Linux RAID subsystem development
 help / color / mirror / Atom feed
From: Phil Turmel <philip@turmel.org>
To: Nic Wolfe <nic@wolfeden.ca>
Cc: linux-raid@vger.kernel.org
Subject: Re: RAID6 growing interrupted, array won't assemble or resume growing
Date: Fri, 07 Jun 2013 08:43:54 -0400	[thread overview]
Message-ID: <51B1D58A.3010309@turmel.org> (raw)
In-Reply-To: <CAGjXdAj+xsW=Ut9EUz65Ha_ZaGwgzJwRG6vh+9S60jpCK1qNpA@mail.gmail.com>

On 06/07/2013 12:15 AM, Nic Wolfe wrote:
> My original post had an error - sda is my boot drive, it's not part of
> the array. The 6th drive is as follows:
> 
> /dev/sdg:

[trim /]

>       Number   Major   Minor   RaidDevice State
> this     5      65        0        5      active sync

Before you move drives around, please spend some time correlating drive
serial numbers to the "RaidDevice" number for each.  Just in case.  You
may be able to get everything you need from /dev/disk/by-id/, but with a
weird driver, I not sure.

>> Your reshape is barely started.  Presumably you specified a --backup
>> clause in the original --grow command.  You will need that file.
> 
> Embarrassingly I have no backup file from the grow operation.

mdadm should have halted if it knew it was needed.  You may be ok.

>> Modern mdadm should be able to force assemble this and continue without
>> problems.  Rather than operate within a questionable environment, I
>> would strongly encourage you to perform the forced assembly with a
>> recent live cd.  I personally use "SystemRescueCD", and I know it has
>> the appropriate kernel support and tools.
>>
>> But.  You need to share more information about your hardware problems.
>> Dmesg, etc.  There are commonly-encountered configuration problems that
>> appear to be mysterious drive failures.  If you know all about error
>> recovery control, please elaborate.  Otherwise, please share the output
>> of "smartctl -x /dev/sdX" for all of your member devices.
> 
> My drives are connected to the machine through a poorly supported old
> RAID card (rr2522) which required me to build the driver into my
> kernel, so I don't think a live cd will work. I don't have enough SATA
> slots in the machine to connect them all without it.

Hmmm.  That's ugly.

> If it isn't obvious by now, I definitely don't know anything about
> error recovery control.

Well, search the archive for terms like "scterc", "ure", and/or
"timeout".  You'll need to address the issue if you don't have
enterprise drives.

> Since it's a RAID card and not just a hba each drive is presented to
> the OS as a single drive JBOD array which means the OS doesn't see any
> SMART info. I can see (at least some) SMART information through my
> RAID card admin console and it claims they're all fine.

Drives with pending unrecoverable read errors won't make it through a
rebuild, but show "PASSED" to smart.  The details matter.

> I am not having hardware problems at the moment, I only encountered
> them when I had 16 drives running through the RAID card (I have
> another 10 drive array). With the other array disconnected the card
> seems to be behaving - there's nothing suspicious that I can see in
> dmesg.

That sounds like insufficient power.

> In the meantime I will see if I can put together a machine with 6 SATA
> ports and attempt to hook the drives up directly rather than through
> the RAID card so I can use a live CD and get the SMART information for
> you.

Good plan, but do get the serial numbers vs. array role info.  Probably
should finish the reshape in such a machine, too.

Phil

  reply	other threads:[~2013-06-07 12:43 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-06-06  6:41 RAID6 growing interrupted, array won't assemble or resume growing Nic Wolfe
2013-06-06 17:31 ` Phil Turmel
2013-06-07  4:15   ` Nic Wolfe
2013-06-07 12:43     ` Phil Turmel [this message]
2013-06-19  6:21       ` Nic Wolfe
2013-06-19 18:36         ` Phil Turmel
2013-06-19 23:52           ` Nic Wolfe
2013-06-21  4:17           ` Nic Wolfe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=51B1D58A.3010309@turmel.org \
    --to=philip@turmel.org \
    --cc=linux-raid@vger.kernel.org \
    --cc=nic@wolfeden.ca \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox