All of lore.kernel.org
 help / color / mirror / Atom feed
From: bugzilla-daemon@bugzilla.kernel.org
To: linux-scsi@vger.kernel.org
Subject: [Bug 14831] mptsas - Use of ATA command pass-through results in unreliable operation - drive / controller resets
Date: Mon, 30 Aug 2010 15:18:34 GMT	[thread overview]
Message-ID: <201008301518.o7UFIYj2029110@demeter1.kernel.org> (raw)
In-Reply-To: <bug-14831-11613@https.bugzilla.kernel.org/>

https://bugzilla.kernel.org/show_bug.cgi?id=14831





--- Comment #41 from kdesai <kashyap.desai@lsi.com>  2010-08-30 15:18:21 ---
(In reply to comment #40)
> descriptions for attachments in #38 and #39 are reversed

I have taken a deep look of all the available logs for below configuration.

kernel 5.5 2.6.18-194.8.1.el5
MPT2BIOS-7.05.01.00 (2010.02.09)
SAS2008-IT 5.00.00.00
LSI driver mpt2sas-05.00.00.00


Things are different in this case. It is not the same issue which is related to
"smartd" mentioned in this bugzilla.

I have seen some kind of hotplug action in this case. (or may be some
connection issue which has created Hotplug kind of situation)

1. See below snippet of (https://bugzilla.kernel.org/attachment.cgi?id=28191)
--
Aug 27 14:23:34 X kernel: mpt2sas0: Device Status Change
Aug 27 14:23:34 X kernel:     handle(0x000f), sas
address(0x4433221107000000)<6>mpt2sas0: SAS Topology Change List
Aug 27 14:23:34 X kernel: sd 0:0:7:0: device_blocked, handle(0x000f)
Aug 27 14:24:02 X kernel: mpt2sas0: attempting task abort!
scmd(ffff81005a235cc0)
Aug 27 14:24:02 X kernel: sd 0:0:7:0: 
Aug 27 14:24:02 X kernel:         comma

---

Driver has received Hotplug action "device delay removal" (this is relavent to
LSI controllers Device missing delay parameters) 
Check "/sys/class/scsi_host/host6/device_delay"

2. Very soon I have seen Some of the Task abort followed by Device delete event
See below snippet.

--ug 27 14:24:02 X kernel: mpt2sas0: attempting task abort!
scmd(ffff81005a235cc0)
Aug 27 14:24:02 X kernel: sd 0:0:7:0: 
Aug 27 14:24:02 X kernel:         command: Write(10): 2a 00 11 51 68 0f 00 04
00 00
Aug 27 14:24:02 X kernel: mpt2sas0: Device Status Change
Aug 27 14:24:02 X kernel: mpt2sas0: task abort: SUCCESS scmd(ffff81005a235cc0)
Aug 27 14:24:02 X kernel: 
Aug 27 14:24:02 X kernel: mpt2sas0: updating handles for
sas_host(0x5003048573212988)
Aug 27 14:24:02 X kernel:     handle(0x000f), sas
address(0x4433221107000000)<6>
Aug 27 14:24:02 X kernel: mpt2sas0: Discovery: (stop)
Aug 27 14:24:02 X kernel: mpt2sas0: Discovery: (start)
Aug 27 14:24:02 X kernel: mpt2sas0: SAS Topology Change List
Aug 27 14:24:02 X kernel: mpt2sas0: tr_send:handle(0x000f), (open), smid(439),
cb(7)
Aug 27 14:24:02 X kernel: mpt2sas0: Discovery: (stop)
Aug 27 14:24:02 X kernel: mpt2sas0: updating handles for
sas_host(0x5003048573212988)
Aug 27 14:24:02 X kernel: mpt2sas0: tr_complete:handle(0x000f), (open)
smid(439), ioc_status(0x0000), loginfo(0x00000000), completed(0)
Aug 27 14:24:02 X kernel: mpt2sas0: sc_send:handle(0x000f), (open), smid(540),
cb(5)
Aug 27 14:24:02 X kernel: mpt2sas0: sc_complete:handle(0x000f), (open)
smid(540), ioc_status(0x0000), loginfo(0x00000000)
Aug 27 14:24:02 X kernel: mpt2sas0: _scsih_remove_device: enter:
handle(0x000f), sas_addr(0x4433221107000000)
Aug 27 14:24:02 X kernel: sd 0:0:7:0: device_unblocked, handle(0x000f)
Aug 27 14:24:02 X kernel: mpt2sas0: removing handle(0x000f),
sas_addr(0x4433221107000000)
Aug 27 14:24:02 X kernel: mpt2sas0: _scsih_remove_device: exit: handle(0x000f),
sas_addr(0x4433221107000000)


---

3. Now Driver immediately receive Device ADD. (see below snippet)
--
Aug 27 14:24:02 X kernel: mpt2sas0: Discovery: (stop)
Aug 27 14:24:02 X kernel: mpt2sas0: REPORT_LUNS: handle(0x000f), retries(0)
Aug 27 14:24:02 X kernel: mpt2sas0:     ioc_status(0x0045),
loginfo(0x00000000), rc(ready)
Aug 27 14:24:02 X kernel: mpt2sas0: TEST_UNIT_READY: handle(0x000f), lun(0)
Aug 27 14:24:02 X kernel: mpt2sas0:     ioc_status(0x0000),
loginfo(0x00000000), rc(retry_ua)
Aug 27 14:24:02 X kernel: mpt2sas0:     [sense_key,asc,ascq]: [0x06,0x29,0x00]
Aug 27 14:24:02 X kernel: mpt2sas0: TEST_UNIT_READY: handle(0x000f), lun(0)
Aug 27 14:24:02 X kernel: mpt2sas0: attempting task abort!
scmd(ffff81005a235cc0)
Aug 27 14:24:02 X kernel: scsi 0:0:7:0: 
Aug 27 14:24:02 X kernel:         command: Test Unit Ready: 00 00 00 00 00 00
Aug 27 14:24:02 X kernel: mpt2sas0: device been deleted! scmd(ffff81005a235cc0)
--

4. At the end HBA reset is executed which is removing device "scsi 0:0:7:0".
It means device is not actually available in firmware table. (this can be
confirm if we have lsiutil option 8 and 16 )

In summary, this can be a completely different issue. Can we move this issue to
new bugzilla, so that I can have a fresh look on it ?

Thanks, Kashyap

-- 
Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are watching the assignee of the bug.

  parent reply	other threads:[~2010-08-30 15:18 UTC|newest]

Thread overview: 44+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <bug-14831-11613@https.bugzilla.kernel.org/>
2010-03-21 11:34 ` [Bug 14831] mptsas - Use of ATA command pass-through results in unreliable operation - drive / controller resets bugzilla-daemon
2010-03-30 14:53 ` bugzilla-daemon
2010-04-29  7:25 ` bugzilla-daemon
2010-04-29  9:27 ` bugzilla-daemon
2010-05-03 22:22 ` bugzilla-daemon
2010-05-04  5:16 ` bugzilla-daemon
2010-05-04  9:16 ` bugzilla-daemon
2010-05-05 10:35 ` bugzilla-daemon
2010-05-07  4:56 ` bugzilla-daemon
2010-05-07  8:01 ` bugzilla-daemon
2010-05-12  8:50 ` bugzilla-daemon
2010-05-12  9:04 ` bugzilla-daemon
2010-05-12  9:33 ` bugzilla-daemon
2010-05-12 10:22 ` bugzilla-daemon
2010-05-24  9:05 ` bugzilla-daemon
2010-05-26  8:08 ` bugzilla-daemon
2010-06-07 20:00 ` bugzilla-daemon
2010-06-08  0:29 ` bugzilla-daemon
2010-06-08  6:44 ` bugzilla-daemon
2010-06-08  8:43 ` bugzilla-daemon
2010-06-29 20:26 ` bugzilla-daemon
2010-08-28 15:44 ` bugzilla-daemon
2010-08-28 15:45 ` bugzilla-daemon
2010-08-28 15:46 ` bugzilla-daemon
2010-08-28 15:47 ` bugzilla-daemon
2010-08-28 15:51 ` bugzilla-daemon
2010-08-28 15:53 ` bugzilla-daemon
2010-08-30 15:18 ` bugzilla-daemon [this message]
2010-08-30 16:42 ` bugzilla-daemon
2010-08-31  8:17 ` bugzilla-daemon
2010-09-28 19:07 ` bugzilla-daemon
2012-06-18 13:20 ` bugzilla-daemon
2012-06-18 13:20 ` bugzilla-daemon
2009-12-18 11:25 [Bug 14831] New: " bugzilla-daemon
2009-12-18 12:44 ` [Bug 14831] " bugzilla-daemon
2009-12-18 15:18 ` bugzilla-daemon
2009-12-18 15:31 ` bugzilla-daemon
2009-12-18 15:32 ` bugzilla-daemon
2009-12-21  4:51 ` bugzilla-daemon
2009-12-21  4:52 ` bugzilla-daemon
2009-12-21 12:08 ` bugzilla-daemon
2009-12-21 12:11 ` bugzilla-daemon
2010-01-11 11:59 ` bugzilla-daemon
2010-01-12 23:15 ` bugzilla-daemon
2010-01-13 12:30 ` bugzilla-daemon

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=201008301518.o7UFIYj2029110@demeter1.kernel.org \
    --to=bugzilla-daemon@bugzilla.kernel.org \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.