linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* RAID needs more to survive a power hit, different /boot layout for example (was Re: draft howto on making raids for surviving a disk crash)
@ 2008-02-03 19:15 Moshe Yudkowsky
  2008-02-03 20:01 ` Robin Hill
  2008-02-03 20:28 ` Michael Tokarev
  0 siblings, 2 replies; 25+ messages in thread
From: Moshe Yudkowsky @ 2008-02-03 19:15 UTC (permalink / raw)
  To: linux-raid

I've been reading the draft and checking it against my experience. 
Because of local power fluctuations, I've just accidentally checked my 
system:  My system does *not* survive a power hit. This has happened 
twice already today.

I've got /boot and a few other pieces in a 4-disk RAID 1 (three running, 
one spare). This partition is on /dev/sd[abcd]1.

I've used grub to install grub on all three running disks:

grub --no-floppy <<EOF
root (hd0,1)
setup (hd0)
root (hd1,1)
setup (hd1)
root (hd2,1)
setup (hd2)
EOF

(To those reading this thread to find out how to recover: According to 
grub's "map" option, /dev/sda1 maps to hd0,1.)


After the power hit, I get:

 > Error 16
 > Inconsistent filesystem mounted

I then tried to boot up on hda1,1, hdd2,1 -- none of them worked.

The culprit, in my opinion, is the reiserfs file system. During the 
power hit, the reiserfs file system of /boot was left in an inconsistent 
state; this meant I had up to three bad copies of /boot.

Recommendations:

1. I'm going to try adding a data=journal option to the reiserfs file 
systems, including the /boot. If this does not work, then /boot must be 
ext3 in order to survive a power hit.

2. We discussed what should be on the RAID1 bootable portion of the 
filesystem. True, it's nice to have the ability to boot from just the 
RAID1 portion. But if that RAID1 portion can't survive a power hit, 
there's little sense. It might make a lot more sense to put /boot on its 
own tiny partition.

The Fix:

The way to fix this problem with booting is to get the reiser file 
system back into sync. I did this by booting to my emergency single-disk 
partition ((hd0,0) if you must know) and then mounting the /dev/md/root 
that contains /boot. This forced a resierfs consistency check and 
journal replay, and let me reboot without problems.



-- 
Moshe Yudkowsky * moshe@pobox.com * www.pobox.com/~moshe
"A gun is, in many people's minds, like a magic wand. If you point it at 
people,
they are supposed to do your bidding."
    				-- Edwin E. Moise, _Tonkin Gulf_

^ permalink raw reply	[flat|nested] 25+ messages in thread

end of thread, other threads:[~2008-02-06  9:14 UTC | newest]

Thread overview: 25+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-02-03 19:15 RAID needs more to survive a power hit, different /boot layout for example (was Re: draft howto on making raids for surviving a disk crash) Moshe Yudkowsky
2008-02-03 20:01 ` Robin Hill
2008-02-03 20:46   ` Moshe Yudkowsky
2008-02-03 22:01     ` Robin Hill
2008-02-04 11:06       ` Moshe Yudkowsky
2008-02-04 11:40         ` Robin Hill
2008-02-03 20:28 ` Michael Tokarev
2008-02-03 20:54   ` Moshe Yudkowsky
2008-02-03 21:04     ` Michael Tokarev
2008-02-04  9:27     ` Michael Tokarev
2008-02-04 10:58       ` Moshe Yudkowsky
2008-02-04 13:52         ` Michael Tokarev
2008-02-04 14:09           ` Justin Piszcz
2008-02-04 14:25             ` Eric Sandeen
2008-02-04 14:42               ` Eric Sandeen
2008-02-04 15:31               ` Moshe Yudkowsky
2008-02-04 16:45                 ` Eric Sandeen
2008-02-04 17:22                   ` Michael Tokarev
2008-02-05 12:31                     ` Linda Walsh
2008-02-04 16:38               ` Michael Tokarev
2008-02-04 19:02                 ` Richard Scobie
2008-02-04 22:27                 ` Justin Piszcz
2008-02-06  1:12                 ` Linda Walsh
2008-02-06  2:12                   ` Michael Tokarev
2008-02-06  9:14                 ` Luca Berra

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).