* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
[not found] <bug-9018-10286@http.bugzilla.kernel.org/>
@ 2007-09-14 20:56 ` Andrew Morton
2007-09-14 21:13 ` Jeff Garzik
2007-09-14 21:22 ` James Bottomley
0 siblings, 2 replies; 4+ messages in thread
From: Andrew Morton @ 2007-09-14 20:56 UTC (permalink / raw)
To: linux-scsi; +Cc: bugme-new, martijn, Darrick J. Wong
On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
bugme-daemon@bugzilla.kernel.org wrote:
> http://bugzilla.kernel.org/show_bug.cgi?id=9018
>
> Summary: Kernel bug in aic94xx driver shipped with kernel
> 2.6.21.7
> Product: Drivers
> Version: 2.5
> KernelVersion: 2.6.21.7
> Platform: All
> OS/Version: Linux
> Tree: Mainline
> Status: NEW
> Severity: normal
> Priority: P1
> Component: Other
> AssignedTo: drivers_other@kernel-bugs.osdl.org
> ReportedBy: martijn@databoss.nl
>
>
> While rebuilding a MD raid5, every time I try to rebuild:
>
> -- START DUMP --
> RAID5 conf printout:
> --- rd:3 wd:2
> disk 0, o:1, dev:sdb1
> disk 1, o:1, dev:sdc1
> disk 2, o:1, dev:sdd1
> md: recovery of RAID array md0
> md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
> md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec)
> for recovery.
> md: using 128k window, over a total of 71681920 blocks.
> ------------[ cut here ]------------
> kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!
whee! That's BUG_ON(!list_empty(&ascb->list));
yet anoher scsi driver with no entry in MAINTAINERS. Darrick, maybe?
> invalid opcode: 0000 [1] SMP
> CPU 1
> Modules linked in: aic94xx
> Pid: 1182, comm: scsi_eh_2 Not tainted 2.6.21.7 #2
> RIP: 0010:[<ffffffff8800a78d>] [<ffffffff8800a78d>]
> :aic94xx:asd_abort_task+0x3c3/0x4d6
> RSP: 0018:ffff81022e4c5d80 EFLAGS: 00010287
> RAX: 0000000000000000 RBX: ffff810228226380 RCX: 0000000000000001
> RDX: ffff810228226410 RSI: 0000000000000282 RDI: ffff8102280d1098
> RBP: ffff81022fb08000 R08: 0000000000000000 R09: 0000000000000001
> R10: 0000000000000001 R11: ffff81022e3b4480 R12: ffff810228224c80
> R13: 0000000000000000 R14: ffff8102280d1098 R15: ffff810228226380
> FS: 0000000000000000(0000) GS:ffff81022fc3a940(0000) knlGS:0000000000000000
> CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> CR2: 00000000005b2b50 CR3: 0000000000201000 CR4: 00000000000006e0
> Process scsi_eh_2 (pid: 1182, threadinfo ffff81022e4c4000, task
> ffff81022f470440)
> Stack: ffff81022f470440 0000018de129b192 000000002fe8c100 ffff81022fb08108
> 0000000000000000 ffff8102280d1080 ffff8102280d1098 ffff81022e4c5eb0
> ffff810228262a80 ffffffff803c4f1b 0000000000000000 ffff81022e4c5e20
> Call Trace:
> [<ffffffff803c4f1b>] sas_scsi_recover_host+0x1eb/0x690
> [<ffffffff803b22c5>] scsi_error_handler+0xa5/0x300
> [<ffffffff803b2220>] scsi_error_handler+0x0/0x300
> [<ffffffff80288d09>] keventd_create_kthread+0x0/0x65
> [<ffffffff802315fb>] kthread+0xcb/0xf5
> [<ffffffff8025cf38>] child_rip+0xa/0x12
> [<ffffffff80288d09>] keventd_create_kthread+0x0/0x65
> [<ffffffff80231530>] kthread+0x0/0xf5
> [<ffffffff8025cf2e>] child_rip+0x0/0x12
>
>
> Code: 0f 0b eb fe 48 8d bd f0 41 00 00 e8 c6 91 25 f8 48 89 c6 8b
> RIP [<ffffffff8800a78d>] :aic94xx:asd_abort_task+0x3c3/0x4d6
> RSP <ffff81022e4c5d80>
> -- END DUMP --
>
>
> --
> Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email
> ------- You are receiving this mail because: -------
> You are on the CC list for the bug, or are watching someone who is.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
2007-09-14 20:56 ` [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7 Andrew Morton
@ 2007-09-14 21:13 ` Jeff Garzik
2007-09-14 21:22 ` James Bottomley
1 sibling, 0 replies; 4+ messages in thread
From: Jeff Garzik @ 2007-09-14 21:13 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-scsi, bugme-new, martijn, Darrick J. Wong, Gilbert Wu
Andrew Morton wrote:
> On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
> bugme-daemon@bugzilla.kernel.org wrote:
>
>> http://bugzilla.kernel.org/show_bug.cgi?id=9018
>>
>> Summary: Kernel bug in aic94xx driver shipped with kernel
>> 2.6.21.7
>> Product: Drivers
>> Version: 2.5
>> KernelVersion: 2.6.21.7
>> Platform: All
>> OS/Version: Linux
>> Tree: Mainline
>> Status: NEW
>> Severity: normal
>> Priority: P1
>> Component: Other
>> AssignedTo: drivers_other@kernel-bugs.osdl.org
>> ReportedBy: martijn@databoss.nl
>>
>>
>> While rebuilding a MD raid5, every time I try to rebuild:
>>
>> -- START DUMP --
>> RAID5 conf printout:
>> --- rd:3 wd:2
>> disk 0, o:1, dev:sdb1
>> disk 1, o:1, dev:sdc1
>> disk 2, o:1, dev:sdd1
>> md: recovery of RAID array md0
>> md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
>> md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec)
>> for recovery.
>> md: using 128k window, over a total of 71681920 blocks.
>> ------------[ cut here ]------------
>> kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!
>
> whee! That's BUG_ON(!list_empty(&ascb->list));
>
> yet anoher scsi driver with no entry in MAINTAINERS. Darrick, maybe?
Quite honestly, that's the reality of the situation for (IMO) the
majority of SCSI drivers. They just don't really have maintainers at
all, so it winds up falling onto the subsystem maintainer(s) by default.
Gilbert_Wu@adaptec.com just posted a patch to the driver, so he should
probably be kept in the loop.
Jeff
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
2007-09-14 20:56 ` [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7 Andrew Morton
2007-09-14 21:13 ` Jeff Garzik
@ 2007-09-14 21:22 ` James Bottomley
2007-09-18 20:14 ` Martijn Prummel
1 sibling, 1 reply; 4+ messages in thread
From: James Bottomley @ 2007-09-14 21:22 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-scsi, bugme-new, martijn, Darrick J. Wong
On Fri, 2007-09-14 at 13:56 -0700, Andrew Morton wrote:
> On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
> bugme-daemon@bugzilla.kernel.org wrote:
>
> > http://bugzilla.kernel.org/show_bug.cgi?id=9018
> >
> > Summary: Kernel bug in aic94xx driver shipped with kernel
> > 2.6.21.7
> > Product: Drivers
> > Version: 2.5
> > KernelVersion: 2.6.21.7
> > Platform: All
> > OS/Version: Linux
> > Tree: Mainline
> > Status: NEW
> > Severity: normal
> > Priority: P1
> > Component: Other
> > AssignedTo: drivers_other@kernel-bugs.osdl.org
> > ReportedBy: martijn@databoss.nl
> >
> >
> > While rebuilding a MD raid5, every time I try to rebuild:
> >
> > -- START DUMP --
> > RAID5 conf printout:
> > --- rd:3 wd:2
> > disk 0, o:1, dev:sdb1
> > disk 1, o:1, dev:sdc1
> > disk 2, o:1, dev:sdd1
> > md: recovery of RAID array md0
> > md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
> > md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec)
> > for recovery.
> > md: using 128k window, over a total of 71681920 blocks.
> > ------------[ cut here ]------------
> > kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!
>
> whee! That's BUG_ON(!list_empty(&ascb->list));
Yes; it means the task was still in use when we tried to free it. I
surmise that this is the tascb not the ascb in asd_abort_task(). What
this seems to indicate is some sort of race between the abort completing
the task and the owning entity taking it off the sequencer list.
I don't understand this piece of the driver enough yet to fix a more
definite cause.
> yet anoher scsi driver with no entry in MAINTAINERS. Darrick, maybe?
Gilbert Wu should be taking over eventually, but for now it's a bit
unmaintained.
James
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7
2007-09-14 21:22 ` James Bottomley
@ 2007-09-18 20:14 ` Martijn Prummel
0 siblings, 0 replies; 4+ messages in thread
From: Martijn Prummel @ 2007-09-18 20:14 UTC (permalink / raw)
To: James Bottomley; +Cc: Andrew Morton, linux-scsi, bugme-new, Darrick J. Wong
I applied the patch and it's rebuilding now.. It used to crash during
a rebuild sometimes, aswell. But this seems to have been gone already.
I'll stress it for some time and will let you all know whether it
crashes again.
Thanks a lot for your help thusfar!
Best regards,
--
Martijn Prummel
On Sep 14, 2007, at 11:22 PM, James Bottomley wrote:
> On Fri, 2007-09-14 at 13:56 -0700, Andrew Morton wrote:
>> On Fri, 14 Sep 2007 07:11:54 -0700 (PDT)
>> bugme-daemon@bugzilla.kernel.org wrote:
>>
>>> http://bugzilla.kernel.org/show_bug.cgi?id=9018
>>>
>>> Summary: Kernel bug in aic94xx driver shipped with kernel
>>> 2.6.21.7
>>> Product: Drivers
>>> Version: 2.5
>>> KernelVersion: 2.6.21.7
>>> Platform: All
>>> OS/Version: Linux
>>> Tree: Mainline
>>> Status: NEW
>>> Severity: normal
>>> Priority: P1
>>> Component: Other
>>> AssignedTo: drivers_other@kernel-bugs.osdl.org
>>> ReportedBy: martijn@databoss.nl
>>>
>>>
>>> While rebuilding a MD raid5, every time I try to rebuild:
>>>
>>> -- START DUMP --
>>> RAID5 conf printout:
>>> --- rd:3 wd:2
>>> disk 0, o:1, dev:sdb1
>>> disk 1, o:1, dev:sdc1
>>> disk 2, o:1, dev:sdd1
>>> md: recovery of RAID array md0
>>> md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
>>> md: using maximum available idle IO bandwidth (but not more than
>>> 200000 KB/sec)
>>> for recovery.
>>> md: using 128k window, over a total of 71681920 blocks.
>>> ------------[ cut here ]------------
>>> kernel BUG at drivers/scsi/aic94xx/aic94xx_hwi.h:354!
>>
>> whee! That's BUG_ON(!list_empty(&ascb->list));
>
> Yes; it means the task was still in use when we tried to free it. I
> surmise that this is the tascb not the ascb in asd_abort_task(). What
> this seems to indicate is some sort of race between the abort
> completing
> the task and the owning entity taking it off the sequencer list.
>
> I don't understand this piece of the driver enough yet to fix a more
> definite cause.
>
>> yet anoher scsi driver with no entry in MAINTAINERS. Darrick, maybe?
>
> Gilbert Wu should be taking over eventually, but for now it's a bit
> unmaintained.
>
> James
>
>
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2007-09-18 20:14 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <bug-9018-10286@http.bugzilla.kernel.org/>
2007-09-14 20:56 ` [Bugme-new] [Bug 9018] New: Kernel bug in aic94xx driver shipped with kernel 2.6.21.7 Andrew Morton
2007-09-14 21:13 ` Jeff Garzik
2007-09-14 21:22 ` James Bottomley
2007-09-18 20:14 ` Martijn Prummel
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox