From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@bugzilla.kernel.org Subject: [Bug 95101] scsi/mpt2sas: setpci reset results in kernel oops Date: Thu, 19 Mar 2015 21:46:04 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit Return-path: Received: from mail.kernel.org ([198.145.29.136]:34760 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751569AbbCSVqH (ORCPT ); Thu, 19 Mar 2015 17:46:07 -0400 Received: from mail.kernel.org (localhost [127.0.0.1]) by mail.kernel.org (Postfix) with ESMTP id 73ACB20522 for ; Thu, 19 Mar 2015 21:46:06 +0000 (UTC) Received: from bugzilla1.web.kernel.org (bugzilla1.web.kernel.org [172.20.200.51]) by mail.kernel.org (Postfix) with ESMTP id D0F7820525 for ; Thu, 19 Mar 2015 21:46:04 +0000 (UTC) In-Reply-To: Sender: linux-scsi-owner@vger.kernel.org List-Id: linux-scsi@vger.kernel.org To: linux-scsi@vger.kernel.org https://bugzilla.kernel.org/show_bug.cgi?id=95101 Joe Lawrence changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |joe.lawrence@stratus.com --- Comment #1 from Joe Lawrence --- >>From the /var/log/messages attachment: Mar 16 01:13:10 RHEL63 kernel: mpt2sas1: _base_fault_reset_work: Running mpt2sas_dead_ioc thread success !!!! ... Mar 16 01:13:20 RHEL63 kernel: mpt2sas1: _scsih_ir_shutdown: timeout Mar 16 01:13:20 RHEL63 kernel: mpt2sas1: removing handle(0x0024), wwid(0x0c4e8a1c03a9b742) indicates that _scsih_remove was called when the driver's watchdog detected that the device was misbehaving. Driver device removal invokes: mpt2sas_base_detach mpt2sas_base_free_resources iounmap(ioc->chip) setting the stage for the crash: Mar 16 01:13:37 RHEL63 kernel: BUG: unable to handle kernel paging request at ffffc900171e0000 Mar 16 01:13:37 RHEL63 kernel: IP: [] mpt2sas_base_get_iocstate+0x10/0x30 [mpt2sas] ... Mar 16 01:13:37 RHEL63 kernel: RAX: ffffc900171e0000 RBX: ffff88105a0aa788 RCX: 0000000000004fdc where mpt2sas_base_get_iocstate was probably calling readl(&ioc->chip->Doorbell) So it would seem that the mpt2sas ioctl code (step 5, I think) isn't synchronized against device removal. -- You are receiving this mail because: You are on the CC list for the bug.