From: Vu Pham <vuhuong-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
To: Linux RDMA list <linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: [ofa-general][PATCH 0/4] SRP fail-over faster
Date: Mon, 12 Oct 2009 15:56:33 -0700 [thread overview]
Message-ID: <4AD3B421.1050704@mellanox.com> (raw)
[-- Attachment #1: Type: text/plain, Size: 1 bytes --]
[-- Attachment #2: [ofa-general][PATCH 0/4] SRP fail-over faster.eml --]
[-- Type: message/rfc822, Size: 1910 bytes --]
From: Vu Pham <vuhuong-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
To: OF General <general-ZwoEplunGu1OwGhvXhtEPSCwEArCW2h5@public.gmane.org>, Roland Dreier <rdreier-FYB4Gu1CFyUAvxtiuMwx3w@public.gmane.org>
Subject: [ofa-general][PATCH 0/4] SRP fail-over faster
Date: Mon, 12 Oct 2009 15:46:30 -0700
Message-ID: <4AD3B1C6.9050400-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Testing srp fail-over with dm-multipath/multipathd/srp_daemon, the
current srp implementation will take average 3-5 minutes to complete the
error recovery before return DID_BAD_TARGET so that dm-multipath can
switch to other paths. During this error recovery, there is no I/O
happening (old and new I/Os)
The following patches attempt to help srp fail-over faster and
controllable. It introduces srp_dev_loss_tmo module parameter, so that,
srp will fail-over after srp_dev_loss_tmo expired.The minimum value for
srp_dev_loss_tmo is 60 seconds.
Patch 1/4: recreate qp and cq at reconnect instead of reuse them
Patch 2/4: disconnect request without wait.
Patch 3/4: introducing srp_dev_loss_tmo, creating a timer on qp_error event.
Patch 4/4: setting up async event handler to handle local port up/down
events
The fail-over will be more accurate on local port up/down events (ie.
someone pull the cable connect local port to switch), it is less
accurate on target port up/down events (ie. someone pull the cable
connect target port to switch)
To be accurate on target port up/down events, it requires to change
srp_daemon to catch the event of IB target port joining/leaving the
fabric, then pass these event down to srp driver, srp driver need to
implement entry points to receive these events and act upon them. These
are missed on this attempt
thanks,
-vu
reply other threads:[~2009-10-12 22:56 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4AD3B421.1050704@mellanox.com \
--to=vuhuong-vpraknaxozvwk0htik3j/w@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox