All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mike Anderson <andmike@us.ibm.com>
To: device-mapper development <dm-devel@redhat.com>
Subject: Re: what is the current utility in testing active paths from multipat hd?
Date: Wed, 27 Apr 2005 11:17:02 -0700	[thread overview]
Message-ID: <20050427181701.GB9368@us.ibm.com> (raw)
In-Reply-To: <20050427170710.GU4431@marowsky-bree.de>

Lars Marowsky-Bree [lmb@suse.de] wrote:
> On 2005-04-27T12:27:32, "goggin, edward" <egoggin@emc.com> wrote:
> > If not, it may be possible to significantly reduce the cpu&io
> > resource utilization consumed by multipathd path testing on
> > enterprise scale configurations by only testing those paths
> > which the kernel thinks are in a failed state -- obviously a
> > much smaller set of paths.
> 
> I could see not testing paths if we knew IO was hitting them; as an
> approximization, the active paths from the active PG might be omitted.
> However, the paths in the inactive PG all need to be tested, or else you
> are never going to find out that the paths have gone bad on you until
> you try to failover.
> 
> The best way to minimize path (re-)testing needed is to figure in the
> hierarchy of components involved; as long as the FC switch is still bad,
> there's no point testing any target which we could reach through it,
> etc; testing whether the switch itself is healthy would round-robin
> through our various connections to the switch, to make sure we don't
> declare the switch down because we got hung up on one failed path.
> 

Once support gets completed / utilized the fc_transport class should
provide data on the link state and the port state which could be provide
indication of path health for deciding if to send a patch check cmd. This
would add complication to the tester as each new transport would need some
type of handler.

> Another option would be to not mechanically test every N seconds, but to
> retest a failed path after 1s - 2s - 4s - ... 32s max as a cascading
> back-off, and maybe start at 2 - 64s for paths in inactive PGs.
> 

A cascading backoff / staggered  timer would require less topology
knowledge than the above path health testing method and would provide the
reduce IO loading desired (depending on how high a user was willing to go
on setting the delta between path tests).

-andmike
--
Michael Anderson
andmike@us.ibm.com

  reply	other threads:[~2005-04-27 18:17 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-04-27 16:27 what is the current utility in testing active paths from multipat hd? goggin, edward
2005-04-27 17:02 ` Alasdair G Kergon
2005-04-27 17:07 ` Lars Marowsky-Bree
2005-04-27 18:17   ` Mike Anderson [this message]
2005-04-27 20:10     ` Lars Marowsky-Bree
2005-04-27 20:23       ` christophe varoqui
2005-04-27 18:36   ` Lan
2005-04-28 16:37 ` Lars Marowsky-Bree

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20050427181701.GB9368@us.ibm.com \
    --to=andmike@us.ibm.com \
    --cc=dm-devel@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.