From: Chandra Seetharaman <sekharan@us.ibm.com>
To: device-mapper development <dm-devel@redhat.com>
Subject: Re: Multipath failover issues
Date: Mon, 16 Mar 2009 15:40:53 -0700 [thread overview]
Message-ID: <1237243253.309.13.camel@chandra-ubuntu> (raw)
In-Reply-To: <49BEBEB9.7020707@directi.com>
On Tue, 2009-03-17 at 02:33 +0530, dushyanth.h@directi.com wrote:
> Hi,
>
> >> device-mapper: multipath: Failing path 8:32.
> >
> > 8:32 has failed here.
>
> >> sd 2:0:0:0: SCSI error: return code = 0x00020000
> >
> > error code 20000 mean the BUS is busy.
> >
> >> end_request: I/O error, dev sdd, sector 1976776672
> >> device-mapper: multipath: Failing path 8:48.
> >
> > and 8:48 failed because of that.
>
> > Do you know which one was supposed to fail when the RAID controller
> > failed ? (my guess is it is 8:32).
>
> The alert on the storage device was (sorry for not including this earlier)
>
> 257 Critical 2009-03-11 10:38:43 ALERT:Redundant Controller Failure
> Detected (Slot B)
>
> I also found additional logs from /var/log/messages which i did not
> check earlier.
>
> Mar 11 10:32:46 multipathd: sdc: readsector0 checker reports path is down
> Mar 11 10:32:46 multipathd: checker failed path 8:32 in map infortrend01
> Mar 11 10:32:46 multipathd: infortrend01: remaining active paths: 1
> Mar 11 10:32:46 multipathd: sdd: readsector0 checker reports path is down
> Mar 11 10:32:46 multipathd: checker failed path 8:48 in map infortrend01
Does this timing correspond to when you turned off the controller ?
> Mar 11 10:32:46 multipathd: infortrend01: remaining active paths: 0
> Mar 11 10:32:46 multipathd: dm-0: add map (uevent)
> Mar 11 10:32:46 multipathd: dm-0: devmap already registered
> Mar 11 10:32:46 multipathd: dm-0: add map (uevent)
> Mar 11 10:32:47 multipathd: dm-0: devmap already registered
> Mar 11 10:32:47 multipathd: sdd: readsector0 checker reports path is down
>
> So, it looks like 8:32 was the path which had the failed controller and
> during the switch over multipath must have detected 8:48 as busy? if
> this is right, then it must be due to the infortrend device itself.
>
> > looks like for whatever reason the other SCSI bus became busy.
>
> >> sd 2:0:0:0: SCSI error: return code = 0x00020000
> >> end_request: I/O error, dev sdd, sector 1967432880
> >> sd 2:0:0:0: SCSI error: return code = 0x00020000
> >> end_request: I/O error, dev sdd, sector 161647296
>
> Iam assuming it must have been busy for a few secs during the switch
> over and the multipath config doesn't wait enough for the switchover to
> work.
>
Answer to your previous question would help here :)
Set no_path_retry to "queue", which would queue the I/Os when "all" the
paths fail.
If the behavior seen above was caused by the storage and will be
rectified in an acceptable (to the user) time, then this parameter
setting would solve your problem.
BTW, have you seen the I/O successfully been sent to the lun (both paths
- you can use iostat to check it) before you failed the controller ? (I
am trying to see if your config settings are proper).
> Any advice on the below values ?
>
> > Additionaly, I have looked at the mailing list archives & annotated conf
> > files and found two options a) failback and b) no_path_retry. What would
Failback would be useful only when you have different path groups. In a
multibus setup like yours it is of no value.
> > be the best recommended values for these on a dual controller setup like
> > mine ?
> >
> > It would also be helpful if someone could share infotrend specific
> > settings multipath settings.
>
> TIA
> Dushyanth
>
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
next prev parent reply other threads:[~2009-03-16 22:40 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-03-16 16:21 Multipath failover issues dushyanth.h
2009-03-16 16:49 ` Bryn M. Reeves
2009-03-16 16:50 ` Bryn M. Reeves
2009-03-16 17:59 ` Chandra Seetharaman
2009-03-16 21:03 ` dushyanth.h
2009-03-16 22:40 ` Chandra Seetharaman [this message]
2009-03-17 12:30 ` Dushyanth Harinath
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1237243253.309.13.camel@chandra-ubuntu \
--to=sekharan@us.ibm.com \
--cc=dm-devel@redhat.com \
--cc=sekharan@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.