linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: maarten <maarten@ultratux.net>
To: linux-raid@vger.kernel.org
Subject: Re: ext3 journal on software raid (was Re: PROBLEM: Kernel 2.6.10 crashing repeatedly and hard)
Date: Mon, 3 Jan 2005 19:30:36 +0100	[thread overview]
Message-ID: <200501031930.36356.maarten@ultratux.net> (raw)
In-Reply-To: <a41oa2-fr7.ln1@news.it.uc3m.es>

On Monday 03 January 2005 15:23, Peter T. Breuer wrote:
> Michael Tokarev <mjt@tls.msk.ru> wrote:
> > Peter T. Breuer wrote:

> Therefore, I have learned not to build a system that is more complicated
> than the most simple human being that may administer it. This always
> works - if it breaks AND they cannot fix it, then THEY get the blame.
>
> So I "prefer" to not have a raided boot partition, but instead to rsync
> the root partition every day to a spare on a differet disk, or/and at the
> other end of the same disk. This also saves the system from sysadmin
> gaffes - I don't WANT an instantaneous copy of every error made by the
> humans.

There certainly is something to be said for that...  
However, I do expect an admin to know about the raid system that's being used, 
else they would have no business being near that server in the first place. 

> > disks are really 35Gb or 37Gb; in case they're differ, "extra" space
> > on large disk isn't used); root and /boot are on small raid1 partition
> > which is mirrored on *every* disk; swap is on raid1; the rest (/usr,
>
> I like this - except of course that I rsync them, not raid them. I
> don't mind if I have to reboot a server. Nobody will notice the tcp
> outage and the other one of the pair will failover for it, albeit in
> readonly mode, for the maximum of the few minutes required.

I tend to agree, but it varies widely with the circumstances.  I've had 
servers in unattended colo facilties, and your approach will not work too 
well there.

> That's actually not so. Over new year I accidently booted my home
> server (222 days uptime!) and discovered its boot sector had evaporated.

We've all been there...  :-(

> Well, maybe I moved the kernels ..  anyway, it has no floppy and the
> nearest boot cd was an hour's journey away in the cold, on new year.  Uh
> uh.  It took me about 8 hrs, but I booted it via PXE DHCP TFTP
> wake-on-lan and the wireless network, from my laptop, without leaving
> the warm.

Congrats, but I do hope you did that for your home server...!  Cause I'd have 
severe moral and practical difficulties selling that to a paying customer:  
"So instead of billing me a cab fare and two hours, you spent eight hours to 
fix this.  And you seriously expect me to pay for those extra hours ?" 

> > to allocate that space on every of 2 or 3 or 4 or 5 disks).  So
> > it isn't quite relevant how fast the filesystem will be on writes,
> > and hence it's ok to place it on raid1 composed from 5 components.
>
> That is, uh, paranoid.

We also did use three-way raid-1 mirrors as a rule.
(but I am indeed somewhat paranoid ;-)


> > In case of some problem
> > (yes I dislike any additional layers for critical system components
> > as any layer may fail to start during boot etc), you can easily
> > bring the system up by booting off the underlying root-raid partiton
> > to repair the system -- all the utilities are here.  More, you can
>
> Well, you could, and I could, but I doubt if the standard tech could.

I've said it before and I'll say it again:  An admin has to be competent. If 
not, there is little you can do.  You can't have fresh MCSE people fix linux 
problems, and you cannot have a carpenter selling stock on wall street.

A "standard tech" as you say, has a skill level that enables him to swap a 
drive of a hotswap server if so directed, but anything beyond that is 
unrealistic, and he will need adequate help (be it remote by telephone, or 
whatever means).  Or very extensive onsite step by step documentation.

> But why bother? If you didn't have raid there on root you wouldn't
> need to repair it. Nothing is quite as horrible as having a
> fubarred root partition.  That's why I also always have two! But I
> don't see that having the copy made by raid rather than rsync wins
> you anything in the situaton where you have to  reboot - rather, it
> puts off that moment to a moment of your choosing, which may be good,
> but is not an unqualified bonus, given the cons.

Both approaches have their merits.  In one case the danger lies in not having 
updated the rsync mirror recently enough, in the other a rogue change will 
affect all your mirrors.  Without further info on the specific circumstances 
no choice can be made, it really depends on too many factors. 

> > And yes I'm aware of mdp devices (partitions inside the raid
> > arrays).. but that's just another layer "which may fail": if
> > raid5 array won't start, I at least can reconstruct filesystem
> > image by reading chunks of data from appropriate places from
> > all drives and try to recover that image; with any additional
>
> Now that is just perverse.

Not neccessarily.  I've had to rely on using dd_rescue to get data back at 
some point is time. In such scenarios, any additional layer can quickly 
complicate things beyond reasonable recourse.
As you noted yourself, keeping a backup stategy can be hard work. ;-|

> > Note above about swap: in all my
> > systems, swap is also on raid (raid1 in this case).  At the first
> > look, that can be a nonsense: having swap on raid.  But we had
> > enouth cases when due to a failed drive swap becomes corrupt
> > (unreadable really), and the system goes havoc, *damaging*
> > other data which was unaffected by the disk failure!  With
>
> Yes, this used to be quite common when swap had that size bug.

When you have swap on a failed disk, often the safer way is to stop the 
machine by using the reset button instead of attempting a shutdown.
The shutdown would probably fail halfway through anyway...

Maarten


  reply	other threads:[~2005-01-03 18:30 UTC|newest]

Thread overview: 172+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-12-30  0:31 PROBLEM: Kernel 2.6.10 crashing repeatedly and hard Georg C. F. Greve
2004-12-30 16:23 ` Georg C. F. Greve
2004-12-30 17:39   ` Peter T. Breuer
2004-12-30 17:53     ` Sandro Dentella
2004-12-30 18:31       ` Peter T. Breuer
2004-12-30 19:50     ` Michael Tokarev
2004-12-30 21:39       ` Peter T. Breuer
2005-01-02 19:42         ` ext3 journal on software raid (was Re: PROBLEM: Kernel 2.6.10 crashing repeatedly and hard) Andy Smith
2005-01-02 20:18           ` Peter T. Breuer
2005-01-03  0:30             ` Andy Smith
2005-01-03  6:41               ` Neil Brown
2005-01-03  8:37                 ` Peter T. Breuer
2005-01-03  8:03               ` Peter T. Breuer
2005-01-03  8:58                 ` Guy
2005-01-03 10:18                 ` Partiy error detection - was " Brad Campbell
2005-01-03 12:11                 ` Michael Tokarev
2005-01-03 14:23                   ` Peter T. Breuer
2005-01-03 18:30                     ` maarten [this message]
2005-01-03 21:36                     ` Michael Tokarev
2005-01-05  5:50                     ` Debian Sarge mdadm raid 10 assembling at boot problem Roger Ellison
2005-01-05 13:41                       ` Michael Tokarev
2005-01-05 13:57                         ` [help] [I2O] Adaptec 2400A on FC3 Angelo Piraino
2005-01-05 19:15                         ` Debian Sarge mdadm raid 10 assembling at boot problem Roger Ellison
2005-01-05  9:56           ` ext3 journal on software raid (was Re: PROBLEM: Kernel 2.6.10 crashing repeatedly and hard) Andy Smith
2005-01-05 10:44             ` Alvin Oga
2005-01-05 10:56               ` Brad Campbell
2005-01-05 11:39                 ` Alvin Oga
2005-01-05 12:02                   ` Brad Campbell
2005-01-05 13:23                     ` Alvin Oga
2005-01-05 13:33                       ` Brad Campbell
2005-01-05 14:44                         ` parts -- " Alvin Oga
2005-01-19  4:46                           ` Clemens Schwaighofer
2005-01-19  5:05                             ` Alvin Oga
2005-01-19  5:49                               ` Clemens Schwaighofer
2005-01-19  7:08                                 ` Alvin Oga
2005-01-05 13:36                       ` Swap should be mirrored or not? (was Re: ext3 journal on software raid) Andy Smith
2005-01-05 14:12                 ` ext3 journal on software raid (was Re: PROBLEM: Kernel 2.6.10 crashing repeatedly and hard) Erik Mouw
2005-01-05 14:37                   ` Michael Tokarev
2005-01-05 14:55                     ` errors " Alvin Oga
2005-01-05 17:11                     ` Erik Mouw
2005-01-06  5:41                       ` Brad Campbell
2005-01-05 15:17                 ` Guy
2005-01-05 15:33                   ` Alvin Oga
2005-01-05 16:22                     ` Michael Tokarev
2005-01-05 17:23                       ` Peter T. Breuer
2005-01-05 16:23                     ` Andy Smith
2005-01-05 16:30                       ` Andy Smith
2005-01-05 17:04                       ` swp - " Alvin Oga
2005-01-05 17:26                         ` Andy Smith
2005-01-05 18:32                           ` Alvin Oga
2005-01-05 22:35                             ` Andy Smith
2005-01-06  0:57                               ` Guy
2005-01-06  1:28                                 ` Mike Hardy
2005-01-06  3:32                                   ` Guy
2005-01-06  4:49                                     ` Mike Hardy
2005-01-09 21:07                                       ` Mark Hahn
2005-01-06  5:04                                   ` Alvin Oga
2005-01-06  6:18                                     ` Guy
2005-01-06  6:31                                       ` Alvin Oga
2005-01-06  9:38                                     ` swap on RAID (was Re: swp - Re: ext3 journal on software raid) Andy Smith
2005-01-06 17:46                                       ` Mike Hardy
2005-01-06 22:08                                         ` No swap can be dangerous (was Re: swap on RAID (was Re: swp - Re: ext3 journal on software raid)) Andrew Walrond
2005-01-06 22:34                                           ` Jesper Juhl
2005-01-06 22:57                                             ` Mike Hardy
2005-01-06 23:15                                               ` Guy
2005-01-07  9:28                                                 ` Andrew Walrond
2005-02-28 20:07                                                   ` Guy
2005-01-07  1:31                                       ` confused Re: swap on RAID (was Re: swp - Re: ext3 journal on software raid) Alvin Oga
2005-01-07  2:28                                         ` Andy Smith
2005-01-07 13:04                                           ` Alvin Oga
2005-01-09 21:21                                     ` swp - Re: ext3 journal on software raid (was Re: PROBLEM: Kernel 2.6.10 crashing repeatedly and hard) Mark Hahn
2005-01-09 22:20                                       ` Alvin Oga
2005-01-06  5:01                                 ` Alvin Oga
2005-01-05 17:07                     ` Guy
2005-01-05 17:21                       ` Alvin Oga
2005-01-05 17:32                         ` Guy
2005-01-05 18:37                           ` Alvin Oga
2005-01-05 17:34                         ` ECC: RE: ext3 blah blah blah Gordon Henderson
2005-01-05 18:33                           ` Alvin Oga
2005-01-05 17:26                       ` ext3 journal on software raid (was Re: PROBLEM: Kernel 2.6.10 crashing repeatedly and hard) David Greaves
2005-01-05 18:16                         ` Peter T. Breuer
2005-01-05 18:28                           ` Guy
2005-01-05 18:26                         ` Guy
2005-01-05 15:48                   ` Peter T. Breuer
     [not found]       ` <41D45C1F.5030307-XAri/EZa3C4vJsYlp49lxw@public.gmane.org>
2004-12-30 20:54         ` PROBLEM: Kernel 2.6.10 crashing repeatedly and hard berk walker
2005-01-01 13:39         ` Helge Hafting
2005-01-07  6:21     ` Clemens Schwaighofer
2005-01-07  9:39       ` Andy Smith
  -- strict thread matches above, loose matches on Subject: below --
2005-01-03  9:30 ext3 journal on software raid (was Re: PROBLEM: Kernel 2.6.10 crashing repeatedly and hard) Peter T. Breuer
     [not found] <200501030916.j039Gqe23568@inv.it.uc3m.es>
2005-01-03 10:17 ` Guy
2005-01-03 11:31   ` Peter T. Breuer
2005-01-03 17:34     ` Guy
2005-01-03 17:46     ` maarten
2005-01-03 19:52       ` maarten
2005-01-03 20:41         ` Peter T. Breuer
2005-01-03 23:19           ` Peter T. Breuer
2005-01-03 23:46             ` Neil Brown
2005-01-04  0:28               ` Peter T. Breuer
2005-01-04  1:18                 ` Alvin Oga
2005-01-04  4:29                   ` Neil Brown
2005-01-04  8:43                     ` Peter T. Breuer
2005-01-04  2:07                 ` Neil Brown
2005-01-04  2:16                   ` Ewan Grantham
2005-01-04  2:22                     ` Neil Brown
2005-01-04  2:41                       ` Andy Smith
2005-01-04  3:42                         ` Neil Brown
2005-01-04  9:50                           ` Peter T. Breuer
2005-01-04 14:15                             ` David Greaves
2005-01-04 15:20                               ` Peter T. Breuer
2005-01-04 16:42                             ` Guy
2005-01-04 17:46                               ` Peter T. Breuer
2005-01-04  9:30                         ` Maarten
2005-01-04 10:18                           ` Peter T. Breuer
2005-01-04 13:36                             ` Maarten
2005-01-04 14:13                               ` Peter T. Breuer
2005-01-04 19:22                                 ` maarten
2005-01-04 20:05                                   ` Peter T. Breuer
2005-01-04 21:38                                     ` Guy
2005-01-04 23:53                                       ` Peter T. Breuer
2005-01-05  0:58                                       ` Mikael Abrahamsson
2005-01-04 21:48                                     ` maarten
2005-01-04 23:14                                       ` Peter T. Breuer
2005-01-05  1:53                                         ` maarten
2005-01-04  9:46                         ` Peter T. Breuer
2005-01-04 19:02                           ` maarten
2005-01-04 19:12                             ` David Greaves
2005-01-04 21:08                             ` Peter T. Breuer
2005-01-04 22:02                               ` Brad Campbell
2005-01-04 23:20                                 ` Peter T. Breuer
2005-01-05  5:44                                   ` Brad Campbell
2005-01-05  9:00                                     ` Peter T. Breuer
2005-01-05  9:14                                       ` Brad Campbell
2005-01-05  9:28                                         ` Peter T. Breuer
2005-01-05  9:43                                           ` Brad Campbell
2005-01-05 15:09                                             ` Guy
2005-01-05 15:52                                               ` maarten
2005-01-05 10:04                                           ` Andy Smith
2005-01-04 22:21                               ` Neil Brown
2005-01-05  0:08                                 ` Peter T. Breuer
2005-01-04 22:29                               ` Neil Brown
2005-01-05  0:19                                 ` Peter T. Breuer
2005-01-05  1:19                                   ` Jure Pe_ar
2005-01-05  2:29                                     ` Peter T. Breuer
2005-01-05  0:38                               ` maarten
2005-01-04  9:40                   ` Peter T. Breuer
2005-01-04 14:03                     ` David Greaves
2005-01-04 14:07                       ` Peter T. Breuer
2005-01-04 14:43                         ` David Greaves
2005-01-04 15:12                           ` Peter T. Breuer
2005-01-04 16:54                             ` David Greaves
2005-01-04 17:42                               ` Peter T. Breuer
2005-01-04 19:12                                 ` David Greaves
2005-01-04  0:45           ` maarten
2005-01-04 10:14             ` Peter T. Breuer
2005-01-04 13:24               ` Maarten
2005-01-04 14:05                 ` Peter T. Breuer
2005-01-04 15:31                   ` Maarten
2005-01-04 16:21                     ` Peter T. Breuer
2005-01-04 20:55                       ` maarten
2005-01-04 21:11                         ` Peter T. Breuer
2005-01-04 21:38                         ` Peter T. Breuer
2005-01-04 23:29                           ` Guy
2005-01-04 19:57                     ` Mikael Abrahamsson
2005-01-04 21:05                       ` maarten
2005-01-04 21:26                         ` Alvin Oga
2005-01-04 21:46                         ` Guy
2005-01-03 20:22       ` Peter T. Breuer
2005-01-03 23:05         ` Guy
2005-01-04  0:08         ` maarten
2005-01-03 21:36       ` Guy
2005-01-04  0:15         ` maarten
2005-01-04 11:21           ` Michael Tokarev

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200501031930.36356.maarten@ultratux.net \
    --to=maarten@ultratux.net \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).