From: "Mike Snitzer" <snitzer@gmail.com>
To: Paul Clements <paul.clements@steeleye.com>
Cc: Bill Davidsen <davidsen@tmr.com>, Neil Brown <neilb@suse.de>,
linux-raid@vger.kernel.org, linux-kernel@vger.kernel.org,
nbd-general@lists.sourceforge.net,
Herbert Xu <herbert@gondor.apana.org.au>
Subject: Re: raid1 with nbd member hangs MD on SLES10 and RHEL5
Date: Thu, 14 Jun 2007 21:21:09 -0400 [thread overview]
Message-ID: <170fa0d20706141821t5204ecddl30f846a98b262573@mail.gmail.com> (raw)
In-Reply-To: <4671E85E.30100@steeleye.com>
On 6/14/07, Paul Clements <paul.clements@steeleye.com> wrote:
> Mike Snitzer wrote:
> > On 6/14/07, Paul Clements <paul.clements@steeleye.com> wrote:
> >> Mike Snitzer wrote:
> >>
> >> > Here are the steps to reproduce reliably on SLES10 SP1:
> >> > 1) establish a raid1 mirror (md0) using one local member (sdc1) and
> >> > one remote member (nbd0)
> >> > 2) power off the remote machine, whereby severing nbd0's connection
> >> > 3) perform IO to the filesystem that is on the md0 device to enduce
> >> > the MD layer to mark the nbd device as "faulty"
> >> > 4) cat /proc/mdstat hangs, sysrq trace was collected
> >>
> >> That's working as designed. NBD works over TCP. You're going to have to
> >> wait for TCP to time out before an error occurs. Until then I/O will
> >> hang.
> >
> > With kernel.org 2.6.15.7 (uni-processor) I've not seen NBD hang in the
> > kernel like I am with RHEL5 and SLES10. This hang (tcp timeout) is
> > indefinite oh RHEL5 and ~5min on SLES10.
> >
> > Should/can I be playing with TCP timeout values? Why was this not a
> > concern with kernel.org 2.6.15.7; I was able to "feel" the nbd
> > connection break immediately; no MD superblock update hangs, no
> > longwinded (or indefinite) TCP timeout.
>
> I don't know. I've never seen nbd immediately start returning I/O
> errors. Perhaps something was different about the configuration?
> If the other other machine rebooted quickly, for instance, you'd get a
> connection reset, which would kill the nbd connection.
OK, I'll retest the 2.6.15.7 setup. As for SLES10 and RHEL5, I've
been leaving the remote server powered off. As such I'm at the full
mercy of the TCP timeout. It is odd that RHEL5 has been hanging
indefinitely but I'll dig deeper on that once I come to terms with how
kernel.org and SLES10 behaves.
I'll update with my findings for completeness.
Thanks for your insight!
Mike
next prev parent reply other threads:[~2007-06-15 1:21 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-06-13 2:30 raid1 with nbd member hangs MD on SLES10 and RHEL5 Mike Snitzer
2007-06-13 2:42 ` Neil Brown
2007-06-13 2:59 ` Mike Snitzer
[not found] ` <170fa0d20706122009h5e3db54ek7487be4940a3d780@mail.gmail.com>
[not found] ` <18031.25581.353761.802283@notabene.brown>
[not found] ` <170fa0d20706122130q2c77d365tbe9261bab1a5b1b@mail.gmail.com>
2007-06-13 18:23 ` Mike Snitzer
2007-06-13 23:30 ` Mike Snitzer
2007-06-14 21:05 ` Bill Davidsen
2007-06-14 21:57 ` Mike Snitzer
2007-06-15 0:40 ` Paul Clements
2007-06-15 1:01 ` Mike Snitzer
2007-06-15 1:05 ` Paul Clements
2007-06-15 1:10 ` Mike Snitzer
2007-06-15 1:16 ` Paul Clements
2007-06-15 1:21 ` Mike Snitzer [this message]
2007-06-15 13:21 ` Bill Davidsen
2007-06-15 1:00 ` Paul Clements
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=170fa0d20706141821t5204ecddl30f846a98b262573@mail.gmail.com \
--to=snitzer@gmail.com \
--cc=davidsen@tmr.com \
--cc=herbert@gondor.apana.org.au \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-raid@vger.kernel.org \
--cc=nbd-general@lists.sourceforge.net \
--cc=neilb@suse.de \
--cc=paul.clements@steeleye.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).