From: Mike Christie <michaelc@cs.wisc.edu>
To: device-mapper development <dm-devel@redhat.com>
Cc: open-iscsi@googlegroups.com
Subject: Re: Failing on NIC removal
Date: Mon, 19 Nov 2007 15:58:21 -0600 [thread overview]
Message-ID: <474206FD.3050402@cs.wisc.edu> (raw)
In-Reply-To: <20071119214601.GB829@linux.vnet.ibm.com>
Mike Anderson wrote:
> cc'ing open-iscsi
>
> Scott Moseman <scmoseman@gmail.com> wrote:
>> So I finally got my multipath running through both the NIC and HBA
>> interfaces, but I'm not having any luck going through testing to
>> verify it's actually failing over between the connections.
>>
>> # multipath -l
>> mpath0 (30690a018f015191a6472441d1500f057)
>> [size=4 GB][features="0"][hwhandler="0"]
>> \_ round-robin 0 [active]
>> \_ 3:0:0:0 sdc 8:32 [active][ready]
>> \_ round-robin 0 [enabled]
>> \_ 1:0:1:0 sdb 8:16 [active][ready]
>>
>> I can unplug the HBA (see below) and the connection to the SAN remains.
>>
>> # multipath -l
>> mpath0 (30690a018f015191a6472441d1500f057)
>> [size=4 GB][features="0"][hwhandler="0"]
>> \_ round-robin 0 [active]
>> \_ 3:0:0:0 sdc 8:32 [active][ready]
>> \_ round-robin 0 [enabled]
>> \_ 1:0:1:0 sdb 8:16 [failed][faulty]
>>
>> But when I unplug the NIC connection, the multipath command hands,
>> trying to list files on the SAN partition hangs, and I'm getting these
>> messages:
>>
>> Nov 19 17:15:13 ems1 kernel: iscsi-sfnet:host3: Connect failed with rc
>> -113: No route to host
>> Nov 19 17:15:13 ems1 kernel: iscsi-sfnet:host3: establish_session
>> failed. Could not connect to target
>> Nov 19 17:15:13 ems1 kernel: iscsi-sfnet:host3: Waiting 10 seconds
>> before next login attempt
>>
>> How to troubleshoot this situation?
>
> The IO is hanging waiting for the connection to be reestablished.
>
> You may need to set ConnFailTimeout to a non-zero value as indicated in
> http://people.redhat.com/mchristi/iscsi/RHEL4/doc/readme
>
Mike Anderson is right. If you are using multipath you should set
ConnFailTimeout to a low value like 3 or 5 seconds, because we want to
fail commands quickly to the multipath layer. For dm-multipath you want
to then set no_path_retry to either queue IO forever (or until the paths
come back), or to some timeout.
next prev parent reply other threads:[~2007-11-19 21:58 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-11-19 17:16 Failing on NIC removal Scott Moseman
2007-11-19 21:46 ` Mike Anderson
2007-11-19 21:58 ` Mike Christie [this message]
2007-11-20 14:16 ` Scott Moseman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=474206FD.3050402@cs.wisc.edu \
--to=michaelc@cs.wisc.edu \
--cc=dm-devel@redhat.com \
--cc=open-iscsi@googlegroups.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.