From mboxrd@z Thu Jan 1 00:00:00 1970 From: Alan Cox Subject: Re: Core scsi layer crashes in 2.6.8.1 Date: Wed, 29 Sep 2004 14:24:10 +0100 Sender: linux-scsi-owner@vger.kernel.org Message-ID: <1096464245.15907.27.camel@localhost.localdomain> References: <1096401785.13936.5.camel@localhost.localdomain> <1096467125.2028.11.camel@mulgrave> Mime-Version: 1.0 Content-Type: text/plain Content-Transfer-Encoding: 7bit Return-path: Received: from clock-tower.bc.nu ([81.2.110.250]:8330 "EHLO localhost.localdomain") by vger.kernel.org with ESMTP id S268501AbUI2O0g (ORCPT ); Wed, 29 Sep 2004 10:26:36 -0400 In-Reply-To: <1096467125.2028.11.camel@mulgrave> List-Id: linux-scsi@vger.kernel.org To: James Bottomley Cc: Linux Kernel Mailing List , SCSI Mailing List On Mer, 2004-09-29 at 15:11, James Bottomley wrote: > What was the oops? > > I have a theory that we should be taking a device reference before > waking up the error handler, otherwise host removal can race with error > handling. The sequence I scribbled down from the console was Illegal state transition Cancel->Offline Badness in scsi_device_set_state scsi_device_set_state scsi_unjam_host scsi_error_handler badness in kref_get kobject_get, get_device, scsi_request_fn blk_insert_request, scsi_queue_insert scsi_eh_flush_done_q, scsi_unjam_host scsi_error_handler OOPS scsi_device_dev_release device_release kobject_cleanup kobject_release kref_put scsi_request_fn Alan