From: Francis Moreau <francis.moro@gmail.com>
To: Martin Wilck <mwilck@arcor.de>
Cc: linux-raid@vger.kernel.org
Subject: Re: MD array keeps resyncing after rebooting
Date: Sat, 3 Aug 2013 13:02:37 +0200 [thread overview]
Message-ID: <CAC9WiBj1STmwgUswei--dr1YSbnpPgc9M8thvGuAK1wziKa==Q@mail.gmail.com> (raw)
In-Reply-To: <51FBF830.5060606@arcor.de>
Hello Martin,
First of all, thanks a lot for your detailed analysis.
On Fri, Aug 2, 2013 at 8:19 PM, Martin Wilck <mwilck@arcor.de> wrote:
> On 08/02/2013 02:46 PM, Francis Moreau wrote:
>
>> Please tell me if you can find something suspicous.
>
> One thing that I can see is that your BIOS seems to use the same time
> stamp everywhere. It is clear from the dump that the BIOS has changed
> the timestamp in the VD GUID, too. The timestamp used in the
> "before-bios" data is 2013-08-02 08:21:10, the timestamp after is
> 2013-08-02 14:27:27. Wonder if that fits?
Probably. I booted the system at 8:21, let the resync finish and did
the dump after. So it seems to fit.
>
> The spec says that the VD GUID consists of the vendor ID ("LSI " in
> your case), controller type (the controller's PCI ID,
> 8086:1d60:0000:0000), the 4-byte time stamp, and a "random number"
> (0000 1450). Strangely, I also have an LSI fake RAID and it uses the
> same random number. It even generated the same number through several
> RAID creations. Seems to be a truly strange random number generator :-)
>
> All in all, this makes your controller's RAID GUIDs very predictable.
> But they change whenever the timestamp changes. It also explains why
> this controller can't have more than a single array.
>
> But that's not what you wanted to know, right?
Well that was very instructive, thanks.
> Besides the time stamp,
> sequence number, and CRC32, there is actually no difference between the
> two dumps. The suspicious part is here, same in both dumps:
>
> 00000860 00 00 ff ff ff ff ff ff ff ff ff ff ff ff ff ff
>
> The first two bytes in thus line are the state and init state, and the
> meaning is "Optimal, consistent, *not initialized*".
> And this is before *and* after BIOS started. Suspiciously, this doesn't
> match what you wrote before:
>
>> I checked during the shudown process that the array is correctly
>> > stopped since at that point I got:
>> >
>> > # mdadm -E /dev/sda | egrep "state"
>> > state[0] : Optimal, Consistent
>> > init state[0] : Fully Initialised
>
> This would correspond to "00 02", and it's what we should see after
> initialization. On my system the BIOS sets "00 01" (Optimal, consistent,
> Quick Init in progress) when it first creates an array, because the BIOS
> doesn't do a full initialization. But "not initialized" is weird. The
> mdadm DDF code won't set this by itself, AFAIK. Please make sure again
> that the "before" data matches what mdadm/mdmon wrote just after
> stopping during shutdown.
I'm pretty sure that's the "before" dump was just after stopping the
array, this is how I proceed: during the shutdown, I stop the system
right after "mdadm -S", then I checked the state of the array with
"mdadm -E", which was initialized for sure. Then I powered off the
machine, removed one disk and did the "before" dump from my laptop.
After that I booted the machine with the disk in place until grub menu
appeared. I then powered off the machine and did the same procedure as
previously to get "after" dump.
Perhaps some data were still in a "cache" and was not written to the
disk when the power-off/reboot did happen ?
Maybe one thing that worths to note is that the same disk array
configuration with the same system works fine on qemu.
--
Francis
next prev parent reply other threads:[~2013-08-03 11:02 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-07-23 9:46 MD array keeps resyncing after rebooting Francis Moreau
2013-07-23 12:55 ` Francis Moreau
2013-07-23 18:52 ` Martin Wilck
2013-07-23 18:52 ` Martin Wilck
2013-07-23 20:01 ` Francis Moreau
2013-07-23 21:21 ` Martin Wilck
2013-07-24 4:32 ` Francis Moreau
2013-07-24 13:50 ` Francis Moreau
2013-07-25 18:58 ` Martin Wilck
2013-07-25 19:06 ` Martin Wilck
2013-07-25 20:23 ` Francis Moreau
2013-07-29 15:46 ` Francis Moreau
2013-07-25 20:21 ` Francis Moreau
2013-07-31 19:36 ` Francis Moreau
2013-08-01 13:28 ` Francis Moreau
2013-08-01 18:15 ` Martin Wilck
2013-08-02 12:49 ` Francis Moreau
[not found] ` <CAC9WiBhGyE=OJdSeL_OsPxtirhJ2=3WRsk=uBPiOTzMjBCf-dA@mail.gmail.com>
2013-08-02 18:19 ` Martin Wilck
2013-08-03 0:08 ` Sam Bingner
2013-08-03 11:02 ` Francis Moreau [this message]
2013-08-08 7:18 ` Francis Moreau
2013-08-24 18:54 ` Martin Wilck
2013-08-26 7:44 ` Francis Moreau
2013-08-31 19:23 ` Francis Moreau
2013-09-01 17:05 ` Martin Wilck
2013-08-05 6:08 ` NeilBrown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAC9WiBj1STmwgUswei--dr1YSbnpPgc9M8thvGuAK1wziKa==Q@mail.gmail.com' \
--to=francis.moro@gmail.com \
--cc=linux-raid@vger.kernel.org \
--cc=mwilck@arcor.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).