linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: bernd@rhm.de
To: linux-raid@vger.kernel.org
Subject: Re: Linux 2.6.10 / RAID1 problem
Date: Wed, 5 Jan 2005 23:46:56 +0100 (MEZ)	[thread overview]
Message-ID: <200501052246.XAA07775@node130.rhm.de> (raw)
In-Reply-To: <41DC515C.7030208@anduras.de> from Sven Anders at "Jan 5, 5 09:43:08 pm"

Hi Sven,

we used RAID1 under Kernel 2.4.x without any problems (except the counters
for activ- and spare disks became wrong). With kernel 2.6.x RAID1 is a 
desaster. This problem was dicussed in this newsgroup in the second half
of last year. The kernel dies if a disk fails or if the system is heavy 
loaded (the latter one we can't reproduce but it happens, may be there
will be some other issues on the SCSI bus for other reasons leading to
the same result). 

If one disk fails or if there is a bus reset on one of the two SCSI
controlles we are using for RAID1 the system crashes. It doesn't go
on with the remaining disk on the other controller which is expected
for RAID1 (and what is the reason for using RAID1 at all!). In the 
logs we see many failed superblock writes and other strange things
just before the kernel gives up and dies. From another company we 
heard about data corruptions if the above takes place. We didn't see 
this, may be we are using Reiserfs which they don't. 

I asked about the state of this problem some days ago in a new thread 
in this newsgroup but there was no response. I wonder nobody has similar
problems out there.

Another hint. You can't reproduce this using mdadm -f to set a disk
faulty. This will work fine because it enters the kernel in a 
different way as a 'real' disk failure will do.

So in summary I can answer your question about stability of RAID1
under 2.6.x with no, even if our problems do not correspond exactely
to yours!!!

Greetings Bernd Rieke
R&H Computer Systems
Tel +49 (0)89 750078

-------------------------------------------------------------------------

>Hello!
>
>I'm expirencing strange problems on my server running Software RAID1 (Mirroring) under
>the Linux kernel 2.6.10. It works nicely under Linux 2.4.18!
>
>Configuration:
>~ Athlon 1.5GHz, 256MB RAM (tested!)
>~ 2x160GB Harddisks (same type) on different IDE controllers
>~  (ext3 fs with journalling turned on)
>~ Swap is turned off
>~ Linux Kernel 2.6.10 (vanilla) without SMP, Preemption turned off
>
>Test case:
>~ dd if=/dev/zero of=test0 bs=1M count=300
>~ while :; do cp test0 test1; cp test1 test2; cp test2 test0; od test0; done
>
>Error:
>~ On Linux 2.4.18 it worked serveral hours perfectly (until I stopped it...)
>~ On Linux 2.6.10 after some minutes the following (or similar) error occures:
>
>EXT3-fs error (device md5): ext3_free_blocks_sb: bit already cleared for block 1303980
>Aborting journal on device md5.
>ext3_abort called.
>EXT3-fs error (device md5): ext3_journal_start_sb: Detected aborted journal
>Remounting filesystem read-only
>EXT3-fs error (device md5) in start_transaction: Journal has aborted
.... snip ....
.... snip ....
>EXT3-fs error (device md5) in start_transaction: Journal has aborted
>
>After this I have to do a filesystem check and the 2.4.18 kernel reports a dirty RAID and
>starts the resync...
>
>Does anybody have an idea?
>What's the cause of this???
>
>Is there an known bug in the 2.6.10?
>Is the Software RAID in the 2.6 series stable?
>
>Any special kernel compile options not to use when using RAID?
>Any other (better) test to do?
>
>Regards
>~ Sven
>
>~ Sven Anders <anders@anduras.de>
>
>~ ANDURAS service solutions AG
>~ Innstraße 71 - 94036 Passau - Germany
>~ Web: www.anduras.de - Tel: +49 (0)851-4 90 50-0 - Fax: +49 (0)851-4 90 50-55
>
>Rechtsform: Aktiengesellschaft - Sitz: Passau - Amtsgericht Passau HRB 6032
>Mitglieder des Vorstands: Sven Anders, Marcus Junker, Michael Schön
>Vorsitzender des Aufsichtsrats: Dipl. Kfm. Karlheinz Antesberger
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  parent reply	other threads:[~2005-01-05 22:46 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-01-05 20:43 Linux 2.6.10 / RAID1 problem Sven Anders
2005-01-05 22:35 ` Neil Brown
2005-01-06 12:01   ` Sven Anders
2005-02-14 20:02     ` Linux 2.6.10 / RAID1 problem (SOLVED) Sven Anders
2005-01-05 22:46 ` bernd [this message]
2005-01-05 22:55   ` Linux 2.6.10 / RAID1 problem Neil Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200501052246.XAA07775@node130.rhm.de \
    --to=bernd@rhm.de \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).