All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Pierre Vignéras" <pierre@vigneras.name>
To: MRK <mrk@shiftmail.org>, linux-raid@vger.kernel.org
Subject: Re: mdadm: failed devices become spares!
Date: Fri, 21 May 2010 23:00:35 +0200	[thread overview]
Message-ID: <201005212300.35935.pierre@vigneras.name> (raw)
In-Reply-To: <4BF313CC.9030401@shiftmail.org>

On mercredi 19 mai 2010, MRK wrote:
> On 05/18/2010 04:06 AM, Neil Brown wrote:
> > However if --monitor gets to check the array between the above to events,
> > it will first see that the working drive is now faulty, so it reports a
> > failure, and then see that the faulty device isn't faulty any more and in
> > fact isn't even there.  The "isn't event there" bit doesn't register and
> > it treats it as 'SpareActive'.
> >
> > I should fix that.
> 
> However in one case the two events are not detected in the same round:
> 
> Apr 12 20:10:02 phobos mdadm[3157]: Fail event detected on md device
>  /dev/md2, component device /dev/sdf1
> Apr 12 20:11:02 phobos mdadm[3157]: SpareActive event detected on md device
> /dev/md2, component device /dev/sdf1
> 
> 
> 1 minute passes between the two entries. I suppose that's the mdadm
> daemon polling time.
> 
> In the other case all the entries are at the same time
> 
> Apr 13 08:00:02 phobos mdadm[3157]: Fail event detected on md device
>  /dev/md2, component device /dev/sdd1
> Apr 13 08:00:02 phobos mdadm[3157]: SpareActive event detected on md device
> /dev/md2, component device /dev/sdd1
> Apr 13 08:00:02 phobos last message repeated 7 times
> [...many times that messages..]
> 
> 
> ...plus, in this second case the SpareActive triggers a lot of times
> within that same second (Pierre you cut it short, but are all the "many
> times that messages" all at the exact same time or they span a few
>  seconds?)

Well I was probably  tired when I tried to filter the log for the bug report. 
It seems that this 'last message repeated 7 times' is for the:

Apr 13 08:00:02 phobos kernel: [5814019.208017] nfsd: non-standard errno: 5

not for the:

Apr 13 08:00:02 phobos mdadm[3157]: SpareActive event detected on md device 
/dev/md2, component device /dev/sdd1

I looked into my log and can't find something else. Sorry, sorry, sorry if 
this led you to false conclusions.

> It looks to me like some kind of usb failure where the USB connection or
> USB bridge momentarily fails then immediately gets re-detected and
> re-added to the system. But since there are no usb entries in dmesg,
> that would also be an issue of the usb driver. Could the problem also be
> a mixture with some unwise udev triggers of Debian, maybe somehow
> causing the auto-re-add of the drive to the RAID?
> 
> Pierre:
> - can you post your mdadm.conf?

Sure, but I am not sure it will be useful:

$ cat /etc/mdadm/mdadm.conf
# mdadm.conf
#
# Please refer to mdadm.conf(5) for information about this file.
#

# by default, scan all partitions (/proc/partitions) for MD superblocks.
# alternatively, specify devices to scan, using wildcards if desired.
DEVICE partitions

# auto-create devices with Debian standard permissions
CREATE owner=root group=disk mode=0660 auto=yes

# automatically tag new arrays as belonging to the local system
HOMEHOST <system>

# instruct the monitoring daemon where to send mail alerts
MAILADDR root

# definitions of existing MD arrays
ARRAY /dev/md0 level=raid1 num-devices=2 
UUID=13f4fdef:db0bd815:77e02d4f:1bda00b4
ARRAY /dev/md1 level=raid1 num-devices=2 
UUID=4a120782:2ed3053c:e99784b3:b8e5f7bf
ARRAY /dev/md4 level=raid1 num-devices=2 
UUID=b3c7212a:e95c5081:24bf28c1:396de87f
ARRAY /dev/md2 level=raid10 num-devices=4 
UUID=b34f4192:f823df58:24bf28c1:396de87f
ARRAY /dev/md3 level=raid5 num-devices=3 
UUID=e1f30f82:0999431b:24bf28c1:396de87f



> - USB is not good for RAID imho. Many times in my life I saw problems
> with USB/SATA bridges where the drive would get disconnected on high I/O
> activity and then reconnected after a few seconds. Anyway, readding it
> to the RAID shouldn't have happened. Also in my case there were "usb"
> entries in dmesg.

Well, that is what I discover: USB and RAID is not currently fine (hum, on 
Debian stable, not sure, we can say 'currently', kernel is:

$ uname -a
Linux phobos 2.6.26-2-686 #1 SMP Tue Mar 9 17:35:51 UTC 2010 i686 GNU/Linux
$

).

Anyway, it would be a great feature if USB can be used for a RAID setup, at 
least for end users (actually, I am using in my setup, a "special" layout for 
the using of RAID on several heterogeneous drives that I described here:

http://www.linuxconfig.org/prouhd-raid-for-the-end-user

)

Thanks for your help and regards.
-- 
Pierre Vignéras
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  parent reply	other threads:[~2010-05-21 21:00 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-05-16 15:40 mdadm: failed devices become spares! Pierre Vignéras
2010-05-16 19:56 ` Leslie Rhorer
2010-05-17 18:10   ` Pierre Vignéras
2010-05-17 21:09     ` Tim Small
2010-05-18  1:30     ` Neil Brown
2010-05-18  2:06       ` Neil Brown
2010-05-18 22:25         ` MRK
2010-05-19 19:56           ` Simon Matthews
2010-05-21 21:00           ` Pierre Vignéras [this message]
2010-05-21 21:27         ` mdadm: failed devices become spares! -> Solved ! Pierre Vignéras
2010-05-18 23:07       ` mdadm: failed devices become spares! Pierre Vignéras
2010-05-19  1:45         ` Neil Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=201005212300.35935.pierre@vigneras.name \
    --to=pierre@vigneras.name \
    --cc=linux-raid@vger.kernel.org \
    --cc=mrk@shiftmail.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.