From: Vu Pham <vu-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
To: Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org>
Cc: Or Gerlitz <or.gerlitz-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
David Dillow <dave-i1Mk8JYDVaaSihdK6806/g@public.gmane.org>,
Roland Dreier <roland-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>,
"linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
Or Gerlitz <ogerlitz-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>,
Oren Duer <oren-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>,
Sagi Grimberg <sagig-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Subject: Re: [PATCH for 3.8 v3, resend 0/3] IB/SRP patches for kernel 3.8
Date: Wed, 6 Feb 2013 13:42:56 -0800 [thread overview]
Message-ID: <5112CE60.2030607@mellanox.com> (raw)
In-Reply-To: <5112049B.8030406-HInyCGIudOg@public.gmane.org>
[-- Attachment #1: Type: text/plain, Size: 3961 bytes --]
Bart Van Assche wrote:
> On 02/05/13 21:54, Or Gerlitz wrote:
>> On Tue, Feb 5, 2013 at 6:25 PM, Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org>
>> wrote:
>>> On 02/04/13 22:11, Or Gerlitz wrote:
>> Bart, I'd like to sharpen the point: could you please clarify if the
>> series posted to linux-rdma stands for itself in the sense that SRP HA
>> scheme X (please state it) now works/better when the patches applied
>> on top of the latest 3.8-rc cut? OR for X to do better/work, one needs
>> this series AND the one you posted to linux-scsi.
>
> Hello Or,
>
> A huge number of patches have been taken upstream between 3.8-rc1 and
> 3.8-rc6. I have retested these three patches with 3.8-rc6 and would
> appreciate if you would also repeat your tests.
>
> Thanks,
>
> Bart.
Hello Bart,
I tested your 3.8 v3 patchset. I did the following:
- clone & checkout Roland's ib tree for-next branch
- applied Bart's 3.8 v3 patchset
- applied "save & restore host_scribble during error handling" patch -
http://www.mail-archive.com/linux-scsi-u79uwXL29TY76Z2rM5mHXA@public.gmane.org/msg17809.html
I have two paths to target thru port 1 & 2 (scsi_host host9 & host10)
- run I/Os
- disable port 1 @ 19:11:30
- error recovery for host9 kick in @ 19:12:04
- multipath remove the path, I/Os fail-over @ 19:12:51
- error recovery was still going on with host9 (sysfs entry for host9
still intact)
- enable port 1 @19:15:00
- host9 reconnect to target thru error recovery, multipathd module
re-instate the path in kernel; and then host9 is REMOVED, usermode
"multipath -l" did not show re-instate path thru host9
Feb 6 19:15:04 vsa30 kernel: scsi host9: SRP abort called
Feb 6 19:15:05 vsa30 multipathd: overflow in attribute
'/sys/devices/pci0000:00/0000:00:02.0/0000:02:00.0/host9/target9:0:0/9:0:0:2/state'
Feb 6 19:15:14 vsa30 kernel: scsi host9: SRP abort called
Feb 6 19:15:14 vsa30 kernel: scsi host9: SRP reset_device called
Feb 6 19:15:14 vsa30 kernel: scsi host9: ib_srp: SRP reset_host called
Feb 6 19:15:14 vsa30 kernel: scsi host9: ib_srp: reconnect succeeded
Feb 6 19:15:26 vsa30 multipathd: 3600144f0665c4400000050a522180003: sdd
- tur checker reports path is up
Feb 6 19:15:26 vsa30 multipathd: 8:48: reinstated
Feb 6 19:15:26 vsa30 multipathd: 3600144f0665c4400000050a522180003:
remaining active paths: 2
Feb 6 19:15:26 vsa30 multipathd: 3600144f0665c4400000050a522180002: sdc
- tur checker reports path is up
Feb 6 19:15:26 vsa30 multipathd: 8:32: reinstated
Feb 6 19:15:26 vsa30 multipathd: 3600144f0665c4400000050a522180002:
remaining active paths: 2
Feb 6 19:15:26 vsa30 multipathd: sdc: remove path (uevent)
Feb 6 19:15:26 vsa30 multipathd: 3600144f0665c4400000050a522180002:
load table [0 409600 multipath 0 0 1 1 round-robin 0 1 1 8:80 1]
Feb 6 19:15:26 vsa30 multipathd: sdc: path removed from map
3600144f0665c4400000050a522180002
Feb 6 19:15:26 vsa30 kernel: sd 9:0:0:1: [sdc] Synchronizing SCSI cache
Feb 6 19:15:26 vsa30 multipathd: sdd: remove path (uevent)
Feb 6 19:15:26 vsa30 multipathd: 3600144f0665c4400000050a522180003:
load table [0 409600 multipath 0 0 1 1 round-robin 0 1 1 8:96 1]
Feb 6 19:15:26 vsa30 multipathd: sdd: path removed from map
3600144f0665c4400000050a522180003
Feb 6 19:15:26 vsa30 kernel: sd 9:0:0:2: [sdd] Synchronizing SCSI cache
- disable port 2 @19:22:50
- error recovery kicked in on host10 @ 19:23:40
- I/Os failed with NO path to target @ 19:24:27
- without enabling port 2, error recovery was still going on host10
still 19:57:52 and stop.
- host10 was still in sysfs /sys/class/scsi_host/host10 & taking
reference on ib_srp module
- enable port 2 - nothing happened.
Conclusion:
1. disable the port/path long enough >35 minutes, we have dangling scsi
host.
2. enable the port within 30 minute, scsi host re-establish connection,
path re-instate and then scsi_host was removed (no entry in sysfs)
I attached a log here to show what happened above.
thanks,
-vu
[-- Attachment #2: messages.bz2 --]
[-- Type: application/octet-stream, Size: 10661 bytes --]
next prev parent reply other threads:[~2013-02-06 21:42 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-02-01 15:18 [PATCH for 3.8 v3, resend 0/3] IB/SRP patches for kernel 3.8 Bart Van Assche
[not found] ` <510BDCAA.204-HInyCGIudOg@public.gmane.org>
2013-02-01 15:18 ` [PATCH for 3.8 v3, resend 1/3] IB/srp: Track connection state properly Bart Van Assche
2013-02-01 15:19 ` [PATCH for 3.8 v3, resend 2/3] IB/srp: Avoid sending a task management function needlessly Bart Van Assche
2013-02-01 15:21 ` [PATCH for 3.8 v3, resend 3/3] IB/srp: Avoid endless SCSI error handling loop Bart Van Assche
2013-02-04 21:11 ` [PATCH for 3.8 v3, resend 0/3] IB/SRP patches for kernel 3.8 Or Gerlitz
[not found] ` <CAJZOPZLKQV0QvrW5sK8hQJf7AZc+1nUzp+5YCkZ3iVU4oTWbLg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-02-05 16:25 ` Bart Van Assche
[not found] ` <5111327F.6050402-HInyCGIudOg@public.gmane.org>
2013-02-05 20:54 ` Or Gerlitz
[not found] ` <CAJZOPZ+-Zg=jnqg4ZmFL5Yo4_2DoWGcgy=3u6g3Rf9y80pXnpg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-02-06 7:22 ` Bart Van Assche
[not found] ` <5112049B.8030406-HInyCGIudOg@public.gmane.org>
2013-02-06 7:44 ` Or Gerlitz
[not found] ` <511209E5.1010807-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-02-06 7:59 ` Bart Van Assche
[not found] ` <51120D4F.2070102-HInyCGIudOg@public.gmane.org>
2013-02-06 8:25 ` Or Gerlitz
2013-02-06 21:42 ` Vu Pham [this message]
[not found] ` <5112CE60.2030607-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-02-07 9:05 ` Bart Van Assche
[not found] ` <51136E74.9090209-HInyCGIudOg@public.gmane.org>
2013-02-07 9:41 ` Or Gerlitz
[not found] ` <511376C2.6050100-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-02-07 10:15 ` Bart Van Assche
2013-02-07 18:20 ` Vu Pham
[not found] ` <5113F056.4020501-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-02-15 9:39 ` [PATCH] IB/srp: Fail I/O requests if the transport is offline Bart Van Assche
[not found] ` <511E024E.70002-HInyCGIudOg@public.gmane.org>
2013-02-18 4:06 ` David Dillow
[not found] ` <1361160385.7415.2.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org>
2013-02-18 8:11 ` Sagi Grimberg
[not found] ` <5121E217.3080003-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-02-24 8:09 ` Bart Van Assche
[not found] ` <5129CAB6.5030506-HInyCGIudOg@public.gmane.org>
2013-02-24 8:59 ` Sagi Grimberg
[not found] ` <5129D665.3070206-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-02-24 14:42 ` Or Gerlitz
2013-02-21 16:10 ` Bart Van Assche
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5112CE60.2030607@mellanox.com \
--to=vu-vpraknaxozvwk0htik3j/w@public.gmane.org \
--cc=bvanassche-HInyCGIudOg@public.gmane.org \
--cc=dave-i1Mk8JYDVaaSihdK6806/g@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=ogerlitz-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=or.gerlitz-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
--cc=oren-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=roland-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
--cc=sagig-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.