Re: 2.5.73-mm1 falling over in SDET

public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed

* Re: 2.5.73-mm1 falling over in SDET
       [not found] ` <49400000.1056814561@[10.10.2.4]>
@ 2003-06-29  0:02   ` Andrew Morton
  2003-06-29  3:28     ` James Bottomley
  0 siblings, 1 reply; 4+ messages in thread
From: Andrew Morton @ 2003-06-29  0:02 UTC (permalink / raw)
  To: Martin J. Bligh; +Cc: linux-kernel, linux-scsi

"Martin J. Bligh" <mbligh@aracnet.com> wrote:
>
> > The killer SDET has got you, but this is all I got from the chewed
> > remains. Maybe the EIP is enough? ;-) I guess that's a NULL ptr
> > dereference, though garbled somewhat.
> 
> Repeated it and got a much better panic. This is with feral isp driver
> + Mike's patch, BTW. Maybe it's just some stack overflow?

Oh lovely.

> Call Trace:
>  [<c01c42d3>] blk_insert_request+0x47/0x78
>  [<c01d3679>] scsi_queue_insert+0x71/0x7c
>  [<c01d05a4>] scsi_dispatch_cmd+0x180/0x190
>  [<c01d465c>] scsi_request_fn+0x1f4/0x264
>  [<c01c42f1>] blk_insert_request+0x65/0x78
>  [<c01d3679>] scsi_queue_insert+0x71/0x7c
>  [<c01d05a4>] scsi_dispatch_cmd+0x180/0x190
>  [<c01d465c>] scsi_request_fn+0x1f4/0x264
>  [<c01c42f1>] blk_insert_request+0x65/0x78

Yes, isplinux_queuecommand() returns non-zero and the scsi generic layer
cheerfully goes infinitely recursive.



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: 2.5.73-mm1 falling over in SDET
  2003-06-29  0:02   ` 2.5.73-mm1 falling over in SDET Andrew Morton
@ 2003-06-29  3:28     ` James Bottomley
  2003-06-29  5:35       ` Martin J. Bligh
  0 siblings, 1 reply; 4+ messages in thread
From: James Bottomley @ 2003-06-29  3:28 UTC (permalink / raw)
  To: Andrew Morton; +Cc: Martin J. Bligh, Linux Kernel, SCSI Mailing List

[-- Attachment #1: Type: text/plain, Size: 269 bytes --]

On Sat, 2003-06-28 at 19:02, Andrew Morton wrote:
> Yes, isplinux_queuecommand() returns non-zero and the scsi generic layer
> cheerfully goes infinitely recursive.

Sigh, certain persons need to be more careful when doing logic
alterations.

Try the attached.

James


[-- Attachment #2: tmp.diff --]
[-- Type: text/plain, Size: 448 bytes --]

===== drivers/scsi/hosts.c 1.79 vs edited =====
--- 1.79/drivers/scsi/hosts.c	Thu Jun 26 22:51:24 2003
+++ edited/drivers/scsi/hosts.c	Sat Jun 28 22:14:12 2003
@@ -194,7 +194,7 @@
 	shost->use_blk_tcq = sht->use_blk_tcq;
 	shost->highmem_io = sht->highmem_io;
 
-	if (!sht->max_host_blocked)
+	if (sht->max_host_blocked)
 		shost->max_host_blocked = sht->max_host_blocked;
 	else
 		shost->max_host_blocked = SCSI_DEFAULT_HOST_BLOCKED;

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: 2.5.73-mm1 falling over in SDET
  2003-06-29  3:28     ` James Bottomley
@ 2003-06-29  5:35       ` Martin J. Bligh
  2003-06-30 10:17         ` Maneesh Soni
  0 siblings, 1 reply; 4+ messages in thread
From: Martin J. Bligh @ 2003-06-29  5:35 UTC (permalink / raw)
  To: James Bottomley, Andrew Morton; +Cc: Linux Kernel, SCSI Mailing List

--James Bottomley <James.Bottomley@SteelEye.com> wrote (on Saturday, June 28, 2003 22:28:57 -0500):

> On Sat, 2003-06-28 at 19:02, Andrew Morton wrote:
>> Yes, isplinux_queuecommand() returns non-zero and the scsi generic layer
>> cheerfully goes infinitely recursive.
> 
> Sigh, certain persons need to be more careful when doing logic
> alterations.
> 
> Try the attached.

OK, that gets rather further, and I strongly suspect fixes the SCSI
problem. Thanks very much.

But now it just OOMs instead, which seems to be slab failing 
dismally to shrink it's fat ass enough to fit in that lazy-boy.
Ext2 doesn't look desparately happy either. Maybe it's really
that one's fault?

 EXT2-fs error (device sda2): ext2_new_inode: Free inodes count corrupted in group 30
Remounting filesystem read-only
Out of Memory: Killed process 215 (portmap).
Out of Memory: Killed process 338 (sshd).

larry:~# cat /proc/meminfo
MemTotal:     16076324 kB
MemFree:      15068968 kB
Buffers:           476 kB
Cached:         260824 kB
SwapCached:          0 kB
Active:         241972 kB
Inactive:        23196 kB
HighTotal:    15335424 kB
HighFree:     15047680 kB
LowTotal:       740900 kB
LowFree:         21288 kB
SwapTotal:           0 kB
SwapFree:            0 kB
Dirty:               0 kB
Writeback:           0 kB
Mapped:           5396 kB
Slab:           697856 kB
Committed_AS:     9688 kB
PageTables:        264 kB
VmallocTotal:   114680 kB
VmallocUsed:      3156 kB
VmallocChunk:   111524 kB

slabinfo:

dentry_cache      3999781 4011624    160   24    1 : tunables  120   60    8 : slabdata 167151 167151      0

Bad dentries. no no.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: 2.5.73-mm1 falling over in SDET
  2003-06-29  5:35       ` Martin J. Bligh
@ 2003-06-30 10:17         ` Maneesh Soni
  0 siblings, 0 replies; 4+ messages in thread
From: Maneesh Soni @ 2003-06-30 10:17 UTC (permalink / raw)
  To: Martin J. Bligh
  Cc: James Bottomley, Andrew Morton, Linux Kernel, SCSI Mailing List

On Sun, Jun 29, 2003 at 06:09:41AM +0000, Martin J. Bligh wrote:
> --James Bottomley <James.Bottomley@SteelEye.com> wrote (on Saturday, June 28, 2003 22:28:57 -0500):
> 
> > On Sat, 2003-06-28 at 19:02, Andrew Morton wrote:
> >> Yes, isplinux_queuecommand() returns non-zero and the scsi generic layer
> >> cheerfully goes infinitely recursive.
> > 
> > Sigh, certain persons need to be more careful when doing logic
> > alterations.
> > 
> > Try the attached.
> 
> OK, that gets rather further, and I strongly suspect fixes the SCSI
> problem. Thanks very much.
> 
> But now it just OOMs instead, which seems to be slab failing 
> dismally to shrink it's fat ass enough to fit in that lazy-boy.
> Ext2 doesn't look desparately happy either. Maybe it's really
> that one's fault?
> 

I tried sdet on 16-way numaq with 2.5.73-mm2. It completes the run on ext2 
(no OOMs), but gives following oops while running on ext3

------------[ cut here ]------------
kernel BUG at fs/jbd/transaction.c:1132!
invalid operand: 0000 [#1]
SMP 
CPU:    5
EIP:    0060:[<c019f23d>]    Not tainted VLI
EFLAGS: 00010282
EIP is at journal_dirty_metadata+0x171/0x27c
eax: 00000072   ebx: e66818e4   ecx: c03e85e0   edx: c03736d0
esi: e6681800   edi: da9e3580   ebp: e66818e4   esp: d755de84
ds: 007b   es: 007b   ss: 0068
Process cpio (pid: 32707, threadinfo=d755c000 task=d9f70ce0)
Stack: c030f040 c030f5b4 c030f007 0000046c c030f5e0 00002f40 00000069 00000000 
       d832f6e4 c6f0b4c0 c01936d0 e042ca00 e50c2e0c d76df1a4 e042ca00 d76df1a4 
       d80f62a0 e5dfc000 00008000 e5dfc000 00000000 d832f660 e5e73400 e5e72d20 
Call Trace:
 [<c01936d0>] ext3_new_inode+0x210/0x664
 [<c0199658>] ext3_create+0x40/0x8c
 [<c0168ecb>] vfs_create+0x67/0x8c
 [<c01691cd>] open_namei+0x165/0x3e0
 [<c01584ab>] filp_open+0x3b/0x5c
 [<c0158993>] sys_open+0x37/0x78
 [<c01090b3>] syscall_call+0x7/0xb

Code: 54 24 1c f0 0f ab 02 83 7f 14 00 75 29 68 e0 f5 30 c0 68 6c 04 00 00 68 07 f0 30 c0 68 b4 f5 30 c0 68 40 f0 30 c0 e8 db 14 f8 ff <0f> 0b 6c 04 07 f0 30 c0 83 c4 14 8b 47 14 3b 44 24 10 74 63 3b 

Regards,
Maneesh

-- 
Maneesh Soni
IBM Linux Technology Center, 
IBM India Software Lab, Bangalore.
Phone: +91-80-5044999 email: maneesh@in.ibm.com
http://lse.sourceforge.net/

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2003-06-30  9:56 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <45120000.1056810681@[10.10.2.4]>
     [not found] ` <49400000.1056814561@[10.10.2.4]>
2003-06-29  0:02   ` 2.5.73-mm1 falling over in SDET Andrew Morton
2003-06-29  3:28     ` James Bottomley
2003-06-29  5:35       ` Martin J. Bligh
2003-06-30 10:17         ` Maneesh Soni

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox