filesystem corruption with md raid6

All of lore.kernel.org
 help / color / mirror / Atom feed

* filesystem corruption with md raid6
@ 2007-04-26 18:27 Clem Pryke
  2007-04-26 20:17 ` James Bottomley
  0 siblings, 1 reply; 5+ messages in thread
From: Clem Pryke @ 2007-04-26 18:27 UTC (permalink / raw)
  To: linux-scsi; +Cc: pryke

I have a system with 12 SATA disks attached via SAS. When copying into the 
array during re-sync I get filesystem errors and corruption for raid6 but not 
for raid5. This problem is repeatable. I actually have 2 separate 12 disk 
arrays and get the same behavior on both.

Does this sound familiar to anyone?

Here's a little more detail:

- 8 core AMD64 system running RHEL4U4 kernel 2.6.9-42.0.3.ELsmp
- 12 Seagate ST3750640NS disks via LSI SAS1068 card and mptsas driver provided 
with kernel. Disk chassis is Promise VTrak J300s
- build raid5/6 arrays using mdadm in the normal way
- build filesystem using e2fs in the normal way
- mount the array
- fail out and re-add a disk using "mdadm -f/-r/-a /dev/sdd1"
- while re-sync in progress start an rsync to copy large amount of data into 
the array

For raid 5 array while rsync is running re-sync slows down enormously (as 
shown in /proc/mdstat), then speeds up again once rsync complete and all is 
good. Unmounting the array and running e2fsck shows no errors.

For raid 6 re-sync slows down once rsync starts, but then after a few minutes 
I see the errors below and rsync stops. In this case unmounting and running 
e2fsck shows loads of errors.

Apr 26 11:30:50 spt kernel: attempt to access beyond end of device
Apr 26 11:30:50 spt kernel: md0: rw=1, want=14801215240, limit=14651489280
Apr 26 11:30:50 spt kernel: Aborting journal on device md0.
Apr 26 11:30:50 spt kernel: ext3_abort called.
Apr 26 11:30:50 spt kernel: EXT3-fs error (device md0): ext3_journal_start_sb: 
Detected aborted journal
Apr 26 11:30:50 spt kernel: Remounting filesystem read-only
Apr 26 11:30:50 spt kernel: EXT3-fs error (device md0) in start_transaction: 
Journal has aborted

-- 
**********************************************************************
Clem Pryke - Assistant Professor - Astronomy and Astrophysics
University of Chicago,
Room 120, LASR, 933 East 56th Street, Chicago, Illinois 60637, USA
Tel: 773 702-7853  Fax: 773 702-6645  email: pryke@focus.uchicago.edu
**********************************************************************

^ permalink raw reply	[flat|nested] 5+ messages in thread

* filesystem corruption with md raid6
@ 2007-04-26 19:26 Clem Pryke
  2007-04-27  6:24 ` Neil Brown
  0 siblings, 1 reply; 5+ messages in thread
From: Clem Pryke @ 2007-04-26 19:26 UTC (permalink / raw)
  To: linux-raid; +Cc: pryke

 have a system with 12 SATA disks attached via SAS. When copying into the 
array during re-sync I get filesystem errors and corruption for raid6 but not 
for raid5. This problem is repeatable. I actually have 2 separate 12 disk 
arrays and get the same behavior on both.

Does this sound familiar to anyone?

Here's a little more detail:

- 8 core AMD64 system running RHEL4U4 kernel 2.6.9-42.0.3.ELsmp
- 12 Seagate ST3750640NS disks via LSI SAS1068 card and mptsas driver provided 
with kernel. Disk chassis is Promise VTrak J300s
- build raid5/6 arrays using mdadm in the normal way
- build filesystem using e2fs in the normal way
- mount the array
- fail out and re-add a disk using "mdadm -f/-r/-a /dev/sdd1"
- while re-sync in progress start an rsync to copy large amount of data into 
the array

For raid 5 array while rsync is running re-sync slows down enormously (as 
shown in /proc/mdstat), then speeds up again once rsync complete and all is 
good. Unmounting the array and running e2fsck shows no errors.

For raid 6 re-sync slows down once rsync starts, but then after a few minutes 
I see the errors below and rsync stops. In this case unmounting and running 
e2fsck shows loads of errors.

Apr 26 11:30:50 spt kernel: attempt to access beyond end of device
Apr 26 11:30:50 spt kernel: md0: rw=1, want=14801215240, limit=14651489280
Apr 26 11:30:50 spt kernel: Aborting journal on device md0.
Apr 26 11:30:50 spt kernel: ext3_abort called.
Apr 26 11:30:50 spt kernel: EXT3-fs error (device md0): ext3_journal_start_sb: 
Detected aborted journal
Apr 26 11:30:50 spt kernel: Remounting filesystem read-only
Apr 26 11:30:50 spt kernel: EXT3-fs error (device md0) in start_transaction: 
Journal has aborted

-- 
**********************************************************************
Clem Pryke - Assistant Professor - Astronomy and Astrophysics
University of Chicago,
Room 120, LASR, 933 East 56th Street, Chicago, Illinois 60637, USA
Tel: 773 702-7853  Fax: 773 702-6645  email: pryke@focus.uchicago.edu
**********************************************************************

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: filesystem corruption with md raid6
  2007-04-26 18:27 Clem Pryke
@ 2007-04-26 20:17 ` James Bottomley
  0 siblings, 0 replies; 5+ messages in thread
From: James Bottomley @ 2007-04-26 20:17 UTC (permalink / raw)
  To: Clem Pryke; +Cc: linux-scsi

On Thu, 2007-04-26 at 13:27 -0500, Clem Pryke wrote:
> I have a system with 12 SATA disks attached via SAS. When copying into the 
> array during re-sync I get filesystem errors and corruption for raid6 but not 
> for raid5. This problem is repeatable. I actually have 2 separate 12 disk 
> arrays and get the same behavior on both.

If the data is sound for raid5 but not raid6, it tends to indicate that
it might be a md issue; you should perhaps crosspost to
linux-raid@vger.kernel.org which is where the md people hang out.

> Does this sound familiar to anyone?
> 
> Here's a little more detail:
> 
> - 8 core AMD64 system running RHEL4U4 kernel 2.6.9-42.0.3.ELsmp

This is a vendor supported kernel ... perhaps you should start first
with them ... most linux lists are going to want you to try and
reproduce this with the latest kernel ...

> - 12 Seagate ST3750640NS disks via LSI SAS1068 card and mptsas driver provided 
> with kernel. Disk chassis is Promise VTrak J300s
> - build raid5/6 arrays using mdadm in the normal way
> - build filesystem using e2fs in the normal way
> - mount the array
> - fail out and re-add a disk using "mdadm -f/-r/-a /dev/sdd1"
> - while re-sync in progress start an rsync to copy large amount of data into 
> the array
> 
> For raid 5 array while rsync is running re-sync slows down enormously (as 
> shown in /proc/mdstat), then speeds up again once rsync complete and all is 
> good. Unmounting the array and running e2fsck shows no errors.
> 
> For raid 6 re-sync slows down once rsync starts, but then after a few minutes 
> I see the errors below and rsync stops. In this case unmounting and running 
> e2fsck shows loads of errors.
> 
> Apr 26 11:30:50 spt kernel: attempt to access beyond end of device
> Apr 26 11:30:50 spt kernel: md0: rw=1, want=14801215240, limit=14651489280
> Apr 26 11:30:50 spt kernel: Aborting journal on device md0.
> Apr 26 11:30:50 spt kernel: ext3_abort called.
> Apr 26 11:30:50 spt kernel: EXT3-fs error (device md0): ext3_journal_start_sb: 
> Detected aborted journal
> Apr 26 11:30:50 spt kernel: Remounting filesystem read-only
> Apr 26 11:30:50 spt kernel: EXT3-fs error (device md0) in start_transaction: 
> Journal has aborted

This definitely indicates some type of corruption.  I assume a corrupt
inode value caused a read beyond the limits of the device ...
unfortunately, with no other kernel messages, it's difficult to tell
whether it was MD, SCSI or the device driver (or even the device) that
caused it.  Like I said, if it doesn't occur with RAID5, my money would
be on MD.

James



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: filesystem corruption with md raid6
  2007-04-26 19:26 filesystem corruption with md raid6 Clem Pryke
@ 2007-04-27  6:24 ` Neil Brown
  2007-04-27 18:41   ` Bill Davidsen
  0 siblings, 1 reply; 5+ messages in thread
From: Neil Brown @ 2007-04-27  6:24 UTC (permalink / raw)
  To: Clem Pryke; +Cc: linux-raid

On Thursday April 26, pryke@focus.uchicago.edu wrote:
>  have a system with 12 SATA disks attached via SAS. When copying into the 
> array during re-sync I get filesystem errors and corruption for raid6 but not 
> for raid5. This problem is repeatable. I actually have 2 separate 12 disk 
> arrays and get the same behavior on both.
> 
> Does this sound familiar to anyone?

Yes.

> 
> Here's a little more detail:
> 
> - 8 core AMD64 system running RHEL4U4 kernel 2.6.9-42.0.3.ELsmp
                                               ^^^^^

2.6.9 is very old.  If you want support for RHEL, you should really be
asking RedHat.

Tell them it might be fixed by 
   http://linux.bkbits.net:8080/linux-2.6/?PAGE=gnupatch&REV=1.1938.340.65

(I'm not sure how stable that URL is ... maybe take a copy now. it
should start
#### ChangeSet ####
2004-11-11 13:48:33-08:00, neilb@cse.unsw.edu.au 
  [PATCH] md: fix raid6 problem

  Sometimes it didn't read all (working) drives before a parity calculation.
)

but I cannot be certain (it was a long time ago).
That patch came out after 2.6.9 but about 2 months.
So another option is to build a more recent kernel.

NeilBrown

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: filesystem corruption with md raid6
  2007-04-27  6:24 ` Neil Brown
@ 2007-04-27 18:41   ` Bill Davidsen
  0 siblings, 0 replies; 5+ messages in thread
From: Bill Davidsen @ 2007-04-27 18:41 UTC (permalink / raw)
  To: Neil Brown; +Cc: Clem Pryke, linux-raid

Neil Brown wrote:
> On Thursday April 26, pryke@focus.uchicago.edu wrote:
>   
>>  have a system with 12 SATA disks attached via SAS. When copying into the 
>> array during re-sync I get filesystem errors and corruption for raid6 but not 
>> for raid5. This problem is repeatable. I actually have 2 separate 12 disk 
>> arrays and get the same behavior on both.
>>
>> Does this sound familiar to anyone?
>>     
>
> Yes.
>
>   
>> Here's a little more detail:
>>
>> - 8 core AMD64 system running RHEL4U4 kernel 2.6.9-42.0.3.ELsmp
>>     
>                                                ^^^^^
>
> 2.6.9 is very old.  If you want support for RHEL, you should really be
> asking RedHat.
>
> Tell them it might be fixed by 
>    http://linux.bkbits.net:8080/linux-2.6/?PAGE=gnupatch&REV=1.1938.340.65
>
> (I'm not sure how stable that URL is ... maybe take a copy now. it
> should start
> #### ChangeSet ####
> 2004-11-11 13:48:33-08:00, neilb@cse.unsw.edu.au 
>   [PATCH] md: fix raid6 problem
>   
>   Sometimes it didn't read all (working) drives before a parity calculation.
> )
>
> but I cannot be certain (it was a long time ago).
> That patch came out after 2.6.9 but about 2 months.
> So another option is to build a more recent kernel.
>   

Generally if you run RHEL you don't build a newer kernel. Both because 
there are often stability issues being addresses by using RHEL, and 
because RH kernels are often patched in odd ways compared to mainline 
and the system may expect subtly different behavior from the kernel.

For what RH charges, send them a copy of Neil's possible issue and let 
them fix it.

Oh, and dropping a lot of data into a rebuilding array is generally not 
desirable, often the total time will exceed rebuild and update as serial 
operations. When it comes to write performance RAID6 is good for 
reliability. ;-)

-- 
bill davidsen <davidsen@tmr.com>
  CTO TMR Associates, Inc
  Doing interesting things with small computers since 1979


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2007-04-27 18:41 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-04-26 19:26 filesystem corruption with md raid6 Clem Pryke
2007-04-27  6:24 ` Neil Brown
2007-04-27 18:41   ` Bill Davidsen
  -- strict thread matches above, loose matches on Subject: below --
2007-04-26 18:27 Clem Pryke
2007-04-26 20:17 ` James Bottomley

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.