public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
       [not found] <bug-9018-10286@http.bugzilla.kernel.org/>
@ 2007-09-14 20:56 ` Andrew Morton
  2007-09-14 21:13   ` Jeff Garzik
  2007-09-14 21:22   ` James Bottomley
  0 siblings, 2 replies; 4+ messages in thread
From: Andrew Morton @ 2007-09-14 20:56 UTC (permalink / raw)
  To: linux-scsi; +Cc: bugme-new, martijn, Darrick J. Wong

On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
bugme-daemon@bugzilla.kernel.org wrote:

> http://bugzilla.kernel.org/show_bug.cgi?id=9018
> 
>            Summary: Kernel bug in aic94xx driver shipped with kernel
>                     2.6.21.7
>            Product: Drivers
>            Version: 2.5
>      KernelVersion: 2.6.21.7
>           Platform: All
>         OS/Version: Linux
>               Tree: Mainline
>             Status: NEW
>           Severity: normal
>           Priority: P1
>          Component: Other
>         AssignedTo: drivers_other@kernel-bugs.osdl.org
>         ReportedBy: martijn@databoss.nl
> 
> 
> While rebuilding a MD raid5, every time I try to rebuild:
> 
> -- START DUMP --
> RAID5 conf printout:
>  --- rd:3 wd:2
>  disk 0, o:1, dev:sdb1
>  disk 1, o:1, dev:sdc1
>  disk 2, o:1, dev:sdd1
> md: recovery of RAID array md0
> md: minimum _guaranteed_  speed: 1000 KB/sec/disk.
> md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec)
> for recovery.
> md: using 128k window, over a total of 71681920 blocks.
> ------------[ cut here ]------------
> kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!

whee!  That's BUG_ON(!list_empty(&ascb->list));

yet anoher scsi driver with no entry in MAINTAINERS.  Darrick, maybe?

> invalid opcode: 0000 [1] SMP
> CPU 1
> Modules linked in: aic94xx
> Pid: 1182, comm: scsi_eh_2 Not tainted 2.6.21.7 #2
> RIP: 0010:[<ffffffff8800a78d>]  [<ffffffff8800a78d>]
> :aic94xx:asd_abort_task+0x3c3/0x4d6
> RSP: 0018:ffff81022e4c5d80  EFLAGS: 00010287
> RAX: 0000000000000000 RBX: ffff810228226380 RCX: 0000000000000001
> RDX: ffff810228226410 RSI: 0000000000000282 RDI: ffff8102280d1098
> RBP: ffff81022fb08000 R08: 0000000000000000 R09: 0000000000000001
> R10: 0000000000000001 R11: ffff81022e3b4480 R12: ffff810228224c80
> R13: 0000000000000000 R14: ffff8102280d1098 R15: ffff810228226380
> FS:  0000000000000000(0000) GS:ffff81022fc3a940(0000) knlGS:0000000000000000
> CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> CR2: 00000000005b2b50 CR3: 0000000000201000 CR4: 00000000000006e0
> Process scsi_eh_2 (pid: 1182, threadinfo ffff81022e4c4000, task
> ffff81022f470440)
> Stack:  ffff81022f470440 0000018de129b192 000000002fe8c100 ffff81022fb08108
>  0000000000000000 ffff8102280d1080 ffff8102280d1098 ffff81022e4c5eb0
>  ffff810228262a80 ffffffff803c4f1b 0000000000000000 ffff81022e4c5e20
> Call Trace:
>  [<ffffffff803c4f1b>] sas_scsi_recover_host+0x1eb/0x690
>  [<ffffffff803b22c5>] scsi_error_handler+0xa5/0x300
>  [<ffffffff803b2220>] scsi_error_handler+0x0/0x300
>  [<ffffffff80288d09>] keventd_create_kthread+0x0/0x65
>  [<ffffffff802315fb>] kthread+0xcb/0xf5
>  [<ffffffff8025cf38>] child_rip+0xa/0x12
>  [<ffffffff80288d09>] keventd_create_kthread+0x0/0x65
>  [<ffffffff80231530>] kthread+0x0/0xf5
>  [<ffffffff8025cf2e>] child_rip+0x0/0x12
> 
> 
> Code: 0f 0b eb fe 48 8d bd f0 41 00 00 e8 c6 91 25 f8 48 89 c6 8b
> RIP  [<ffffffff8800a78d>] :aic94xx:asd_abort_task+0x3c3/0x4d6
>  RSP <ffff81022e4c5d80>
> -- END DUMP --
> 
> 
> -- 
> Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email
> ------- You are receiving this mail because: -------
> You are on the CC list for the bug, or are watching someone who is.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
  2007-09-14 20:56 ` [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7 Andrew Morton
@ 2007-09-14 21:13   ` Jeff Garzik
  2007-09-14 21:22   ` James Bottomley
  1 sibling, 0 replies; 4+ messages in thread
From: Jeff Garzik @ 2007-09-14 21:13 UTC (permalink / raw)
  To: Andrew Morton; +Cc: linux-scsi, bugme-new, martijn, Darrick J. Wong, Gilbert Wu

Andrew Morton wrote:
> On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
> bugme-daemon@bugzilla.kernel.org wrote:
> 
>> http://bugzilla.kernel.org/show_bug.cgi?id=9018
>>
>>            Summary: Kernel bug in aic94xx driver shipped with kernel
>>                     2.6.21.7
>>            Product: Drivers
>>            Version: 2.5
>>      KernelVersion: 2.6.21.7
>>           Platform: All
>>         OS/Version: Linux
>>               Tree: Mainline
>>             Status: NEW
>>           Severity: normal
>>           Priority: P1
>>          Component: Other
>>         AssignedTo: drivers_other@kernel-bugs.osdl.org
>>         ReportedBy: martijn@databoss.nl
>>
>>
>> While rebuilding a MD raid5, every time I try to rebuild:
>>
>> -- START DUMP --
>> RAID5 conf printout:
>>  --- rd:3 wd:2
>>  disk 0, o:1, dev:sdb1
>>  disk 1, o:1, dev:sdc1
>>  disk 2, o:1, dev:sdd1
>> md: recovery of RAID array md0
>> md: minimum _guaranteed_  speed: 1000 KB/sec/disk.
>> md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec)
>> for recovery.
>> md: using 128k window, over a total of 71681920 blocks.
>> ------------[ cut here ]------------
>> kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!
> 
> whee!  That's BUG_ON(!list_empty(&ascb->list));
> 
> yet anoher scsi driver with no entry in MAINTAINERS.  Darrick, maybe?

Quite honestly, that's the reality of the situation for (IMO) the 
majority of SCSI drivers.  They just don't really have maintainers at 
all, so it winds up falling onto the subsystem maintainer(s) by default.

Gilbert_Wu@adaptec.com just posted a patch to the driver, so he should 
probably be kept in the loop.

	Jeff



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
  2007-09-14 20:56 ` [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7 Andrew Morton
  2007-09-14 21:13   ` Jeff Garzik
@ 2007-09-14 21:22   ` James Bottomley
  2007-09-18 20:14     ` Martijn Prummel
  1 sibling, 1 reply; 4+ messages in thread
From: James Bottomley @ 2007-09-14 21:22 UTC (permalink / raw)
  To: Andrew Morton; +Cc: linux-scsi, bugme-new, martijn, Darrick J. Wong

On Fri, 2007-09-14 at 13:56 -0700, Andrew Morton wrote:
> On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
> bugme-daemon@bugzilla.kernel.org wrote:
> 
> > http://bugzilla.kernel.org/show_bug.cgi?id=9018
> > 
> >            Summary: Kernel bug in aic94xx driver shipped with kernel
> >                     2.6.21.7
> >            Product: Drivers
> >            Version: 2.5
> >      KernelVersion: 2.6.21.7
> >           Platform: All
> >         OS/Version: Linux
> >               Tree: Mainline
> >             Status: NEW
> >           Severity: normal
> >           Priority: P1
> >          Component: Other
> >         AssignedTo: drivers_other@kernel-bugs.osdl.org
> >         ReportedBy: martijn@databoss.nl
> > 
> > 
> > While rebuilding a MD raid5, every time I try to rebuild:
> > 
> > -- START DUMP --
> > RAID5 conf printout:
> >  --- rd:3 wd:2
> >  disk 0, o:1, dev:sdb1
> >  disk 1, o:1, dev:sdc1
> >  disk 2, o:1, dev:sdd1
> > md: recovery of RAID array md0
> > md: minimum _guaranteed_  speed: 1000 KB/sec/disk.
> > md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec)
> > for recovery.
> > md: using 128k window, over a total of 71681920 blocks.
> > ------------[ cut here ]------------
> > kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!
> 
> whee!  That's BUG_ON(!list_empty(&ascb->list));

Yes; it means the task was still in use when we tried to free it.  I
surmise that this is the tascb not the ascb in asd_abort_task(). What
this seems to indicate is some sort of race between the abort completing
the task and the owning entity taking it off the sequencer list.

I don't understand this piece of the driver enough yet to fix a more
definite cause.

> yet anoher scsi driver with no entry in MAINTAINERS.  Darrick, maybe?

Gilbert Wu should be taking over eventually, but for now it's a bit
unmaintained.

James



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
  2007-09-14 21:22   ` James Bottomley
@ 2007-09-18 20:14     ` Martijn Prummel
  0 siblings, 0 replies; 4+ messages in thread
From: Martijn Prummel @ 2007-09-18 20:14 UTC (permalink / raw)
  To: James Bottomley; +Cc: Andrew Morton, linux-scsi, bugme-new, Darrick J. Wong


I applied the patch and it's rebuilding now.. It used to crash during  
a rebuild sometimes, aswell. But this seems to have been gone already.
I'll stress it for some time and will let you all know whether it  
crashes again.

Thanks a lot for your help thusfar!

Best regards,

-- 
Martijn Prummel

On Sep 14, 2007, at 11:22 PM, James Bottomley wrote:

> On Fri, 2007-09-14 at 13:56 -0700, Andrew Morton wrote:
>> On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
>> bugme-daemon@bugzilla.kernel.org wrote:
>>
>>> http://bugzilla.kernel.org/show_bug.cgi?id=9018
>>>
>>>            Summary: Kernel bug in aic94xx driver shipped with kernel
>>>                     2.6.21.7
>>>            Product: Drivers
>>>            Version: 2.5
>>>      KernelVersion: 2.6.21.7
>>>           Platform: All
>>>         OS/Version: Linux
>>>               Tree: Mainline
>>>             Status: NEW
>>>           Severity: normal
>>>           Priority: P1
>>>          Component: Other
>>>         AssignedTo: drivers_other@kernel-bugs.osdl.org
>>>         ReportedBy: martijn@databoss.nl
>>>
>>>
>>> While rebuilding a MD raid5, every time I try to rebuild:
>>>
>>> -- START DUMP --
>>> RAID5 conf printout:
>>>  --- rd:3 wd:2
>>>  disk 0, o:1, dev:sdb1
>>>  disk 1, o:1, dev:sdc1
>>>  disk 2, o:1, dev:sdd1
>>> md: recovery of RAID array md0
>>> md: minimum _guaranteed_  speed: 1000 KB/sec/disk.
>>> md: using maximum available idle IO bandwidth (but not more than  
>>> 200000 KB/sec)
>>> for recovery.
>>> md: using 128k window, over a total of 71681920 blocks.
>>> ------------[ cut here ]------------
>>> kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!
>>
>> whee!  That's BUG_ON(!list_empty(&ascb->list));
>
> Yes; it means the task was still in use when we tried to free it.  I
> surmise that this is the tascb not the ascb in asd_abort_task(). What
> this seems to indicate is some sort of race between the abort  
> completing
> the task and the owning entity taking it off the sequencer list.
>
> I don't understand this piece of the driver enough yet to fix a more
> definite cause.
>
>> yet anoher scsi driver with no entry in MAINTAINERS.  Darrick, maybe?
>
> Gilbert Wu should be taking over eventually, but for now it's a bit
> unmaintained.
>
> James
>
>


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2007-09-18 20:14 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <bug-9018-10286@http.bugzilla.kernel.org/>
2007-09-14 20:56 ` [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7 Andrew Morton
2007-09-14 21:13   ` Jeff Garzik
2007-09-14 21:22   ` James Bottomley
2007-09-18 20:14     ` Martijn Prummel

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox