linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Mike Snitzer" <snitzer@gmail.com>
To: Paul Clements <paul.clements@steeleye.com>
Cc: Bill Davidsen <davidsen@tmr.com>, Neil Brown <neilb@suse.de>,
	linux-raid@vger.kernel.org, linux-kernel@vger.kernel.org,
	nbd-general@lists.sourceforge.net,
	Herbert Xu <herbert@gondor.apana.org.au>
Subject: Re: raid1 with nbd member hangs MD on SLES10 and RHEL5
Date: Thu, 14 Jun 2007 21:01:19 -0400	[thread overview]
Message-ID: <170fa0d20706141801u6d6effd9ub362f3ae397f3d32@mail.gmail.com> (raw)
In-Reply-To: <4671E018.4090105@steeleye.com>

On 6/14/07, Paul Clements <paul.clements@steeleye.com> wrote:
> Bill Davidsen wrote:
>
> > Second, AFAIK nbd hasn't working in a while. I haven't tried it in ages,
> > but was told it wouldn't work with smp and I kind of lost interest. If
> > Neil thinks it should work in 2.6.21 or later I'll test it, since I have
> > a machine which wants a fresh install soon, and is both backed up and
> > available.
>
> Please stop this. nbd is working perfectly fine, AFAIK. I use it every
> day, and so do 100s of our customers. What exactly is it that not's
> working? If there's a problem, please send the bug report.

Paul,

This thread details what I've experienced using MD (raid1) with 2
devices; one being a local scsi device and the other is an NBD device.
 I've yet to put effort to pinpointing the problem in a kernel.org
kernel; however both SLES10 and RHEL5 kernels appear to be hanging in
either 1) nbd or 2) the socket layer.

Here are the steps to reproduce reliably on SLES10 SP1:
1) establish a raid1 mirror (md0) using one local member (sdc1) and
one remote member (nbd0)
2) power off the remote machine, whereby severing nbd0's connection
3) perform IO to the filesystem that is on the md0 device to enduce
the MD layer to mark the nbd device as "faulty"
4) cat /proc/mdstat hangs, sysrq trace was collected

To be clear, the MD superblock update hangs indefinitely on RHEL5.
But with SLES10 it eventually succeeds after ~5min (and MD marks the
nbd0 member faulty); and the other tasks that were blocking waiting
for the MD lock (e.g. 'cat /proc/mdstat') then complete immediately.

If you look back in this thread you'll see traces for md0_raid1 for
both SLES10 and RHEL5.  I hope to try to reproduce this issue on
kernel.org 2.6.16.46 (the basis for SLES10).  If I can I'll then git
bisect back to try to pinpoint the regression; I obviously need to
verify that 2.6.16 works in this situation on SMP.

Mike

  reply	other threads:[~2007-06-15  1:01 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-06-13  2:30 raid1 with nbd member hangs MD on SLES10 and RHEL5 Mike Snitzer
2007-06-13  2:42 ` Neil Brown
2007-06-13  2:59   ` Mike Snitzer
     [not found]     ` <170fa0d20706122009h5e3db54ek7487be4940a3d780@mail.gmail.com>
     [not found]       ` <18031.25581.353761.802283@notabene.brown>
     [not found]         ` <170fa0d20706122130q2c77d365tbe9261bab1a5b1b@mail.gmail.com>
2007-06-13 18:23           ` Mike Snitzer
2007-06-13 23:30             ` Mike Snitzer
2007-06-14 21:05               ` Bill Davidsen
2007-06-14 21:57                 ` Mike Snitzer
2007-06-15  0:40                 ` Paul Clements
2007-06-15  1:01                   ` Mike Snitzer [this message]
2007-06-15  1:05                     ` Paul Clements
2007-06-15  1:10                       ` Mike Snitzer
2007-06-15  1:16                         ` Paul Clements
2007-06-15  1:21                           ` Mike Snitzer
2007-06-15 13:21                   ` Bill Davidsen
2007-06-15  1:00               ` Paul Clements

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=170fa0d20706141801u6d6effd9ub362f3ae397f3d32@mail.gmail.com \
    --to=snitzer@gmail.com \
    --cc=davidsen@tmr.com \
    --cc=herbert@gondor.apana.org.au \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-raid@vger.kernel.org \
    --cc=nbd-general@lists.sourceforge.net \
    --cc=neilb@suse.de \
    --cc=paul.clements@steeleye.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).