From: Mike Loseke <mike.tummy@gmail.com>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: linux-scsi@vger.kernel.org, bugzilla-daemon@bugzilla.kernel.org,
Kay Sievers <kay.sievers@vrfy.org>, Greg KH <greg@kroah.com>
Subject: Re: [Bug 13311] New: mptsas: ioc0: removing ssp device, kernel oops
Date: Tue, 9 Jun 2009 15:27:05 -0600 [thread overview]
Message-ID: <5e83fab10906091427i23f15492rbd5c6b587cb578c8@mail.gmail.com> (raw)
In-Reply-To: <20090528010038.b4d3b2b9.akpm@linux-foundation.org>
[-- Attachment #1: Type: text/plain, Size: 1532 bytes --]
On Thu, May 28, 2009 at 2:00 AM, Andrew Morton
<akpm@linux-foundation.org> wrote:
>
> (switched to email. Please respond via emailed reply-to-all, not via the
> bugzilla web interface).
>
> On Thu, 14 May 2009 18:17:10 GMT bugzilla-daemon@bugzilla.kernel.org wrote:
>
> > http://bugzilla.kernel.org/show_bug.cgi?id=13311
> >
> > Summary: mptsas: ioc0: removing ssp device, kernel oops
>
> I'd have thought that the severity of this problem is not matched by
> the response.
>
> > Product: SCSI Drivers
> > Version: 2.5
> > Kernel Version: 2.6.27.21
>
> Is it reproducible? If so, is there any change that it can be retested
> under a 2.6.29-based kernel?
We've put a 2.6.29 kernel on these two systems and experienced another
kernel oops yesterday. So far, we haven't been able to reproduce it
on demand, but it has occurred under a heavier system load each time
(load average of 16 with 2,000 blocks/sec every 5 seconds writes to
the devices attached using the mptsas driver.
The oops from yesterday isn't identical to the previous oops, but the
end result is the same where the system has to be rebooted. I've
attached that the log capture of the oops.
The system is identical to the original specs, just the kernel has changed:
# cat /proc/version
Linux version 2.6.29.4-0.1-default (root@tile01-primary) (gcc version
4.3.2 [gcc-4_3-branch revision 141291] (SUSE Linux) ) #1 SMP Tue May
26 22:50:58 CDT 2009
Hopefully this is helpful.
Mike
[-- Attachment #2: tile01-secondary.oops --]
[-- Type: application/octet-stream, Size: 7912 bytes --]
Jun 8 17:06:10 tile01-secondary kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Jun 8 17:06:10 tile01-secondary kernel: sd 2:0:0:0: [sda] Unhandled error code
Jun 8 17:06:10 tile01-secondary kernel: sd 2:0:0:0: [sda] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
Jun 8 17:06:10 tile01-secondary kernel: end_request: I/O error, dev sda, sector 207
Jun 8 17:06:10 tile01-secondary kernel: device-mapper: multipath: Failing path 8:0.
Jun 8 17:06:10 tile01-secondary kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Jun 8 17:06:10 tile01-secondary kernel: sd 2:0:0:0: [sda] Unhandled error code
Jun 8 17:06:10 tile01-secondary kernel: sd 2:0:0:0: [sda] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
Jun 8 17:06:10 tile01-secondary kernel: end_request: I/O error, dev sda, sector 65679
Jun 8 17:06:10 tile01-secondary kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Jun 8 17:06:10 tile01-secondary kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Jun 8 17:06:10 tile01-secondary kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Jun 8 17:06:10 tile01-secondary kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Jun 8 17:06:10 tile01-secondary kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: attempting task abort! (sc=ffff88021e08e880)
Jun 8 17:06:11 tile01-secondary kernel: scsi 2:0:0:0: [sda] CDB: Write(10): 2a 00 00 00 f0 87 00 04 00 00
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff88021e08e880)
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: attempting task abort! (sc=ffff880106684dc0)
Jun 8 17:06:11 tile01-secondary kernel: scsi 2:0:0:0: [sda] CDB: Write(10): 2a 00 00 00 f4 87 00 04 00 00
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff880106684dc0)
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: attempting task abort! (sc=ffff8803b0a131c0)
Jun 8 17:06:11 tile01-secondary kernel: scsi 2:0:0:0: [sda] CDB: Write(10): 2a 00 00 00 f8 87 00 04 00 00
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff8803b0a131c0)
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: attempting task abort! (sc=ffff8803b0a13ec0)
Jun 8 17:06:11 tile01-secondary kernel: scsi 2:0:0:0: [sda] CDB: Write(10): 2a 00 00 00 fc 87 00 00 08 00
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff8803b0a13ec0)
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: attempting task abort! (sc=ffff8803b0a13cc0)
Jun 8 17:06:11 tile01-secondary kernel: scsi 2:0:0:0: [sda] CDB: Write(10): 2a 00 00 00 fc 8f 00 04 00 00
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff8803b0a13cc0)
Jun 8 17:06:11 tile01-secondary kernel: mptscsih: ioc0: attempting bus reset! (sc=ffff88021e08e880)
Jun 8 17:06:11 tile01-secondary kernel: scsi 2:0:0:0: [sda] CDB: Write(10): 2a 00 00 00 f0 87 00 04 00 00
Jun 8 17:06:11 tile01-secondary kernel: BUG: unable to handle kernel NULL pointer dereference at (null)
Jun 8 17:06:11 tile01-secondary kernel: IP: [<ffffffffa008cc98>] mptscsih_bus_reset+0x97/0xfa [mptscsih]
Jun 8 17:06:11 tile01-secondary kernel: PGD 82944c067 PUD 82e4e9067 PMD 0
Jun 8 17:06:11 tile01-secondary kernel: Oops: 0000 [#1] SMP
Jun 8 17:06:11 tile01-secondary kernel: last sysfs file: /sys/kernel/uevent_seqnum
Jun 8 17:06:11 tile01-secondary kernel: CPU 1
Jun 8 17:06:11 tile01-secondary kernel: Modules linked in: reiserfs dm_round_robin ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack xt_tcpudp iptable_filter dm_multipath scsi_dh ip_tables iscsi_trgt crc32c x_tables 8021q garp stp bonding ipv6 cpufreq_conservative cpufreq_userspace cpufreq_powersave powernow_k8 ext3 jbd mbcache loop dm_mod qla4xxx scsi_transport_iscsi qla3xxx rtc_cmos i2c_nforce2 rtc_core rtc_lib shpchp forcedeth pcspkr joydev serio_raw mptctl pci_hotplug i2c_core button sr_mod sg cdrom usbhid hid ohci_hcd ehci_hcd sd_mod crc_t10dif usbcore edd xfs exportfs fan 3w_9xxx ide_pci_generic amd74xx ide_core ata_generic thermal processor thermal_sys hwmon sata_nv mptsas mptscsih mptbase scsi_transport_sas pata_amd libata scsi_mod
Jun 8 17:06:11 tile01-secondary kernel: Pid: 175, comm: scsi_eh_2 Not tainted 2.6.29.4-0.1-default #1 H8DM3-2
Jun 8 17:06:11 tile01-secondary kernel: RIP: 0010:[<ffffffffa008cc98>] [<ffffffffa008cc98>] mptscsih_bus_reset+0x97/0xfa [mptscsih]
Jun 8 17:06:11 tile01-secondary kernel: RSP: 0018:ffff88083354ddb0 EFLAGS: 00010203
Jun 8 17:06:11 tile01-secondary kernel: RAX: ffff8804359cb002 RBX: ffff88043368a560 RCX: ffff88021e08e880
Jun 8 17:06:11 tile01-secondary kernel: RDX: 0000000000000000 RSI: 0000000000000004 RDI: ffff88043368a560
Jun 8 17:06:11 tile01-secondary kernel: RBP: ffff88083354dde0 R08: 0000000000000002 R09: 0000000000000000
Jun 8 17:06:11 tile01-secondary kernel: R10: ffffffff80d7e600 R11: 0000000000000010 R12: ffff88021e08e880
Jun 8 17:06:11 tile01-secondary kernel: R13: ffff8804335a3000 R14: ffff8804335a3008 R15: ffff88083354dee0
Jun 8 17:06:11 tile01-secondary kernel: FS: 00007f66c7122740(0000) GS:ffff88043596edc0(0000) knlGS:0000000000000000
Jun 8 17:06:11 tile01-secondary kernel: CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Jun 8 17:06:11 tile01-secondary kernel: CR2: 0000000000000000 CR3: 000000082d955000 CR4: 00000000000006e0
Jun 8 17:06:11 tile01-secondary kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 8 17:06:11 tile01-secondary kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Jun 8 17:06:11 tile01-secondary kernel: Process scsi_eh_2 (pid: 175, threadinfo ffff88083354c000, task ffff8808331082c0)
Jun 8 17:06:11 tile01-secondary kernel: Stack:
Jun 8 17:06:11 tile01-secondary kernel: ffff8804337b4810 0000000000000000 ffff88021e08e880 0000000000002003
Jun 8 17:06:11 tile01-secondary kernel: ffff8804359cb000 0000000000000000 ffff88083354de00 ffffffffa00034ee
Jun 8 17:06:11 tile01-secondary kernel: ffff88021e08e880 0000000000000000 ffff88083354de60 ffffffffa000441f
Jun 8 17:06:11 tile01-secondary kernel: Call Trace:
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffffa00034ee>] scsi_try_bus_reset+0x52/0xde [scsi_mod]
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffffa000441f>] scsi_eh_ready_devs+0x4c3/0x737 [scsi_mod]
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffffa0004bfe>] scsi_error_handler+0x37d/0x51b [scsi_mod]
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffff8022f2ea>] ? __wake_up_common+0x46/0x76
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffffa0004881>] ? scsi_error_handler+0x0/0x51b [scsi_mod]
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffff80251952>] kthread+0x49/0x76
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffff8020d03a>] child_rip+0xa/0x20
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffff80251909>] ? kthread+0x0/0x76
Jun 8 17:06:11 tile01-secondary kernel: [<ffffffff8020d030>] ? child_rip+0x0/0x20
Jun 8 17:06:11 tile01-secondary kernel: Code: 00 48 83 f8 ff 74 0a 48 ff c0 48 89 83 b0 00 00 00 49 8b 04 24 48 89 df be 04 00 00 00 48 8b 90 88 00 00 00 41 8a 85 98 00 00 00 <48> 8b 12 3c 01 19 c0 45 31 c9 45 31 c0 83 e0 1e 31 c9 0f b6 52
Jun 8 17:06:11 tile01-secondary kernel: RIP [<ffffffffa008cc98>] mptscsih_bus_reset+0x97/0xfa [mptscsih]
Jun 8 17:06:11 tile01-secondary kernel: RSP <ffff88083354ddb0>
Jun 8 17:06:11 tile01-secondary kernel: CR2: 0000000000000000
Jun 8 17:06:11 tile01-secondary kernel: ---[ end trace 54f83dcc0f7b0b26 ]---
next prev parent reply other threads:[~2009-06-09 21:27 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-05-14 18:17 [Bug 13311] New: mptsas: ioc0: removing ssp device, kernel oops bugzilla-daemon
2009-05-28 8:00 ` Andrew Morton
2009-05-28 11:54 ` Kay Sievers
2009-06-09 21:27 ` Mike Loseke [this message]
2009-06-09 21:52 ` Andrew Morton
2009-05-28 8:01 ` [Bug 13311] " bugzilla-daemon
2009-05-28 11:54 ` bugzilla-daemon
2009-06-09 21:27 ` bugzilla-daemon
2009-06-09 21:30 ` bugzilla-daemon
2009-06-09 21:53 ` bugzilla-daemon
2009-07-16 21:41 ` bugzilla-daemon
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5e83fab10906091427i23f15492rbd5c6b587cb578c8@mail.gmail.com \
--to=mike.tummy@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=bugzilla-daemon@bugzilla.kernel.org \
--cc=greg@kroah.com \
--cc=kay.sievers@vrfy.org \
--cc=linux-scsi@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).