linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* resync duration ?
@ 2009-05-14 13:06 Rainer Fuegenstein
  2009-05-14 13:58 ` Bryan Mesich
                   ` (2 more replies)
  0 siblings, 3 replies; 19+ messages in thread
From: Rainer Fuegenstein @ 2009-05-14 13:06 UTC (permalink / raw)
  To: linux-raid

Hi guys,

I'm a but confused about the duration of a raid5 resync:

- occasionally, my server with 4*750 gb sata raid5 crashed (because of
problems with the power supply); after rebooting it took about 10 to 12
hours to resync the raid5 (guess it just re-created some parity
information or however it works internally, but didn't have to copy
any data)

- right now I replaced one 750GB disk with a 1.5TB disk, but now
resyncing (according to /proc/mdstat) it is supposed to take only 160
minutes ?! although it needs top copy data to a blank disk ?

is this normal or should I be worried, especially before I pull out
the next 750GB disk and replace it with the next 1.5TB disk ?

tnx in advance.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-14 13:06 resync duration ? Rainer Fuegenstein
@ 2009-05-14 13:58 ` Bryan Mesich
  2009-05-14 16:26   ` Ryan Wagoner
  2009-05-14 17:06   ` John Robinson
  2009-05-15  1:22 ` Neil Brown
  2009-05-18 15:02 ` Bill Davidsen
  2 siblings, 2 replies; 19+ messages in thread
From: Bryan Mesich @ 2009-05-14 13:58 UTC (permalink / raw)
  To: Rainer Fuegenstein; +Cc: linux-raid

[-- Attachment #1: Type: text/plain, Size: 2032 bytes --]

On Thu, May 14, 2009 at 03:06:47PM +0200, Rainer Fuegenstein wrote:
> Hi guys,
> 
> I'm a but confused about the duration of a raid5 resync:
> 
> - occasionally, my server with 4*750 gb sata raid5 crashed (because of
> problems with the power supply); after rebooting it took about 10 to 12
> hours to resync the raid5 (guess it just re-created some parity
> information or however it works internally, but didn't have to copy
> any data)
> 
> - right now I replaced one 750GB disk with a 1.5TB disk, but now
> resyncing (according to /proc/mdstat) it is supposed to take only 160
> minutes ?! although it needs top copy data to a blank disk ?

Just a guess...but your problem might be an artifact of a bug
that has been recently fixed.  Neil sent out mail on 2009-05-04
with the fix I'm thinking about.  Here is an excerpt from his
mail:

Subject: [md PATCH 4/7] md: tidy up status_resync to handle
large arrays.

Two problems in status_resync.
1. It still used Kilobytes as the basic block unit, while most
   code now uses sectors uniformly.
2. It doesn't allow for the possibility that max_sectors exceeds
   the range of "unsigned long".

So
 - change "max_blocks" to "max_sectors", and store sector numbers
   in there and in 'resync'
 - Make 'rt' a 'sector_t' so it can temporarily hold the number
   of remaining sectors.
 - use sector_div rather than normal division.
 - change the magic '100' used to preserve precision to '32'.
   + making it a power of 2 makes division easier
   + it doesn't need to be as large as it was chosen when we
   averaged speed over the entire run.  Now we average speed over the
   last 30 seconds or so.

> is this normal or should I be worried, especially before I pull out
> the next 750GB disk and replace it with the next 1.5TB disk ?

If your sync only takes 160 minutes...then I'd start to poke
around.  If its finishing time is reasonable considering the
array size, then I'd continue with the migration.
 
> tnx in advance.

Bryan

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-14 13:58 ` Bryan Mesich
@ 2009-05-14 16:26   ` Ryan Wagoner
  2009-05-14 17:03     ` Re[2]: " Rainer Fuegenstein
  2009-05-14 17:06   ` John Robinson
  1 sibling, 1 reply; 19+ messages in thread
From: Ryan Wagoner @ 2009-05-14 16:26 UTC (permalink / raw)
  To: Bryan Mesich, Rainer Fuegenstein, linux-raid

Sounds like it could be wrapping around as suggested. Since you are
swapping out all the disks have you thought about stopping the array
and using dd to copy the old disk to the new disk. Then there is no
resync period or degraded array.

Ryan

On Thu, May 14, 2009 at 9:58 AM, Bryan Mesich <bryan.mesich@ndsu.edu> wrote:
> On Thu, May 14, 2009 at 03:06:47PM +0200, Rainer Fuegenstein wrote:
>> Hi guys,
>>
>> I'm a but confused about the duration of a raid5 resync:
>>
>> - occasionally, my server with 4*750 gb sata raid5 crashed (because of
>> problems with the power supply); after rebooting it took about 10 to 12
>> hours to resync the raid5 (guess it just re-created some parity
>> information or however it works internally, but didn't have to copy
>> any data)
>>
>> - right now I replaced one 750GB disk with a 1.5TB disk, but now
>> resyncing (according to /proc/mdstat) it is supposed to take only 160
>> minutes ?! although it needs top copy data to a blank disk ?
>
> Just a guess...but your problem might be an artifact of a bug
> that has been recently fixed.  Neil sent out mail on 2009-05-04
> with the fix I'm thinking about.  Here is an excerpt from his
> mail:
>
> Subject: [md PATCH 4/7] md: tidy up status_resync to handle
> large arrays.
>
> Two problems in status_resync.
> 1. It still used Kilobytes as the basic block unit, while most
>   code now uses sectors uniformly.
> 2. It doesn't allow for the possibility that max_sectors exceeds
>   the range of "unsigned long".
>
> So
>  - change "max_blocks" to "max_sectors", and store sector numbers
>   in there and in 'resync'
>  - Make 'rt' a 'sector_t' so it can temporarily hold the number
>   of remaining sectors.
>  - use sector_div rather than normal division.
>  - change the magic '100' used to preserve precision to '32'.
>   + making it a power of 2 makes division easier
>   + it doesn't need to be as large as it was chosen when we
>   averaged speed over the entire run.  Now we average speed over the
>   last 30 seconds or so.
>
>> is this normal or should I be worried, especially before I pull out
>> the next 750GB disk and replace it with the next 1.5TB disk ?
>
> If your sync only takes 160 minutes...then I'd start to poke
> around.  If its finishing time is reasonable considering the
> array size, then I'd continue with the migration.
>
>> tnx in advance.
>
> Bryan
>
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re[2]: resync duration ?
  2009-05-14 16:26   ` Ryan Wagoner
@ 2009-05-14 17:03     ` Rainer Fuegenstein
  0 siblings, 0 replies; 19+ messages in thread
From: Rainer Fuegenstein @ 2009-05-14 17:03 UTC (permalink / raw)
  To: Ryan Wagoner; +Cc: Bryan Mesich, linux-raid

guys,

It's amazing - it really finished resyncing after about 160 minutes.
cpu load went down to 0.1 (from about 2.70) and there was nearly no
disk activity. looks like it was really that fast and not just
displaying a false estimate. (and yes, the xfs file system on top if
it seems to be intact, at least for now).

RW> Sounds like it could be wrapping around as suggested. Since you are
RW> swapping out all the disks have you thought about stopping the array
RW> and using dd to copy the old disk to the new disk. Then there is no
RW> resync period or degraded array.

I'm not feeling comfortable with dd'ing from/to disks of different
sizes/layouts. and I'd like to avoid long downtimes of the server.
Originally I planned to add 2 2port sata controllers and install the
new disks in parallel (as suggested by chris), but that didn't
work out so well.

anyway, over night I'll backup the most important data to the disk I
just pulled out (and will do so with the other disks as I pull them
out). will replace the second disk tomorrow.

I'll let you know of success or failure as the story progresses.


RW> Ryan

RW> On Thu, May 14, 2009 at 9:58 AM, Bryan Mesich <bryan.mesich@ndsu.edu> wrote:
>> On Thu, May 14, 2009 at 03:06:47PM +0200, Rainer Fuegenstein wrote:
>>> Hi guys,
>>>
>>> I'm a but confused about the duration of a raid5 resync:
>>>
>>> - occasionally, my server with 4*750 gb sata raid5 crashed (because of
>>> problems with the power supply); after rebooting it took about 10 to 12
>>> hours to resync the raid5 (guess it just re-created some parity
>>> information or however it works internally, but didn't have to copy
>>> any data)
>>>
>>> - right now I replaced one 750GB disk with a 1.5TB disk, but now
>>> resyncing (according to /proc/mdstat) it is supposed to take only 160
>>> minutes ?! although it needs top copy data to a blank disk ?
>>
>> Just a guess...but your problem might be an artifact of a bug
>> that has been recently fixed.  Neil sent out mail on 2009-05-04
>> with the fix I'm thinking about.  Here is an excerpt from his
>> mail:
>>
>> Subject: [md PATCH 4/7] md: tidy up status_resync to handle
>> large arrays.
>>
>> Two problems in status_resync.
>> 1. It still used Kilobytes as the basic block unit, while most
>>   code now uses sectors uniformly.
>> 2. It doesn't allow for the possibility that max_sectors exceeds
>>   the range of "unsigned long".
>>
>> So
>>  - change "max_blocks" to "max_sectors", and store sector numbers
>>   in there and in 'resync'
>>  - Make 'rt' a 'sector_t' so it can temporarily hold the number
>>   of remaining sectors.
>>  - use sector_div rather than normal division.
>>  - change the magic '100' used to preserve precision to '32'.
>>   + making it a power of 2 makes division easier
>>   + it doesn't need to be as large as it was chosen when we
>>   averaged speed over the entire run.  Now we average speed over the
>>   last 30 seconds or so.
>>
>>> is this normal or should I be worried, especially before I pull out
>>> the next 750GB disk and replace it with the next 1.5TB disk ?
>>
>> If your sync only takes 160 minutes...then I'd start to poke
>> around.  If its finishing time is reasonable considering the
>> array size, then I'd continue with the migration.
>>
>>> tnx in advance.
>>
>> Bryan
>>
RW> --
RW> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
RW> the body of a message to majordomo@vger.kernel.org
RW> More majordomo info at  http://vger.kernel.org/majordomo-info.html


------------------------------------------------------------------------------
Unix gives you just enough rope to hang yourself -- and then a couple of more 
feet, just to be sure.
(Eric Allman)
------------------------------------------------------------------------------

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-14 13:58 ` Bryan Mesich
  2009-05-14 16:26   ` Ryan Wagoner
@ 2009-05-14 17:06   ` John Robinson
  2009-05-14 18:07     ` Bryan Mesich
  1 sibling, 1 reply; 19+ messages in thread
From: John Robinson @ 2009-05-14 17:06 UTC (permalink / raw)
  To: Linux RAID, Rainer Fuegenstein

On 14/05/2009 14:58, Bryan Mesich wrote:
[...]
> If your sync only takes 160 minutes...then I'd start to poke
> around.  If its finishing time is reasonable considering the
> array size, then I'd continue with the migration.

I think about 160 minutes is reasonable for 750GB, on an otherwise 
unloaded box; it's only 70MB/sec off 3 drives and on to 1 newer, faster 
drive.

Cheers,

John.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-14 17:06   ` John Robinson
@ 2009-05-14 18:07     ` Bryan Mesich
  2009-05-14 19:38       ` Richard Scobie
  0 siblings, 1 reply; 19+ messages in thread
From: Bryan Mesich @ 2009-05-14 18:07 UTC (permalink / raw)
  To: John Robinson; +Cc: Linux RAID, Rainer Fuegenstein

[-- Attachment #1: Type: text/plain, Size: 795 bytes --]

On Thu, May 14, 2009 at 06:06:51PM +0100, John Robinson wrote:
> I think about 160 minutes is reasonable for 750GB, on an otherwise  
> unloaded box; it's only 70MB/sec off 3 drives and on to 1 newer, faster  
> drive.

I have a similar setup with 4 750G SATA drives.  The box is
failry beefy, with 8 processor cores and a PCI-E SATA controler
(3Ware 9650SE).  For an initial sync, it will take about 315
minutes to complete.

I'm not sure if this is a fair comparison, since Rainer was do a
resync.

md8 : active raid5 sdx1[4] sdw1[2] sdv1[1] sdu1[0]
      2197209600 blocks level 5, 64k chunk, algorithm 2 [4/3] [UUU_]
      [>....................]  recovery =  1.6% (12015536/732403200) finish=312.3min speed=38437K/sec

I'm probably splitting hairs at this point.

Bryan

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-14 18:07     ` Bryan Mesich
@ 2009-05-14 19:38       ` Richard Scobie
  0 siblings, 0 replies; 19+ messages in thread
From: Richard Scobie @ 2009-05-14 19:38 UTC (permalink / raw)
  To: Linux RAID; +Cc: Rainer Fuegenstein

Bryan Mesich wrote:
> On Thu, May 14, 2009 at 06:06:51PM +0100, John Robinson wrote:
> 
>>I think about 160 minutes is reasonable for 750GB, on an otherwise  
>>unloaded box; it's only 70MB/sec off 3 drives and on to 1 newer, faster  
>>drive.
> 
> 
> I have a similar setup with 4 750G SATA drives.  The box is
> failry beefy, with 8 processor cores and a PCI-E SATA controler
> (3Ware 9650SE).  For an initial sync, it will take about 315
> minutes to complete.
> 
> I'm not sure if this is a fair comparison, since Rainer was do a
> resync.
> 
> md8 : active raid5 sdx1[4] sdw1[2] sdv1[1] sdu1[0]
>       2197209600 blocks level 5, 64k chunk, algorithm 2 [4/3] [UUU_]
>       [>....................]  recovery =  1.6% (12015536/732403200) finish=312.3min speed=38437K/sec
> 
> I'm probably splitting hairs at this point.
> 
> Bryan

Just to add another data point, I replaced a dead 750GB drive in 4 x 750 
md RAID5 last week and it took 145 minutes.

Resync on another 16 x 750GB md RAID6 takes around 180min.

Regards,

Richard

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-14 13:06 resync duration ? Rainer Fuegenstein
  2009-05-14 13:58 ` Bryan Mesich
@ 2009-05-15  1:22 ` Neil Brown
  2009-05-15  4:04   ` Leslie Rhorer
  2009-05-15 11:16   ` Rainer Fuegenstein
  2009-05-18 15:02 ` Bill Davidsen
  2 siblings, 2 replies; 19+ messages in thread
From: Neil Brown @ 2009-05-15  1:22 UTC (permalink / raw)
  To: Rainer Fuegenstein; +Cc: linux-raid

On Thursday May 14, rfu@kaneda.iguw.tuwien.ac.at wrote:
> Hi guys,
> 
> I'm a but confused about the duration of a raid5 resync:
> 
> - occasionally, my server with 4*750 gb sata raid5 crashed (because of
> problems with the power supply); after rebooting it took about 10 to 12
> hours to resync the raid5 (guess it just re-created some parity
> information or however it works internally, but didn't have to copy
> any data)

A resync reads all the data and checks the parity.  If it find bad
parity (which is likely to be very rare) then it corrects it.
So resync should run at the full speed of the hardware, just reading.

> 
> - right now I replaced one 750GB disk with a 1.5TB disk, but now
> resyncing (according to /proc/mdstat) it is supposed to take only 160
> minutes ?! although it needs top copy data to a blank disk ?
> 
> is this normal or should I be worried, especially before I pull out
> the next 750GB disk and replace it with the next 1.5TB disk ?

recovery reads N-1 drives and writes the N'th drive.  So it shuffles
exactly the same amount of data as sync, and performs just as many IO
operations, though some of them are writes rather than reads.  I
believe that (on rotating drives) writes go at much the same speed as
reads.
So you would expect this to take exactly as long as the resync.

As others have said, 3 hours it a reasonable ball-part for 750Gig.
10-12 hours is definitely longer than I would expect.

Of course, if use are using the array then resync (or recovery) will
back off and let the regular IO have priority.  So if you were
actively using the array during the 10-12 hours resync, then it makes
perfect sense.

NeilBrown

^ permalink raw reply	[flat|nested] 19+ messages in thread

* RE: resync duration ?
  2009-05-15  1:22 ` Neil Brown
@ 2009-05-15  4:04   ` Leslie Rhorer
  2009-05-15  4:44     ` NeilBrown
  2009-05-15 11:16   ` Rainer Fuegenstein
  1 sibling, 1 reply; 19+ messages in thread
From: Leslie Rhorer @ 2009-05-15  4:04 UTC (permalink / raw)
  To: 'Linux RAID'

> > I'm a but confused about the duration of a raid5 resync:
> >
> > - occasionally, my server with 4*750 gb sata raid5 crashed (because of
> > problems with the power supply); after rebooting it took about 10 to 12
> > hours to resync the raid5 (guess it just re-created some parity
> > information or however it works internally, but didn't have to copy
> > any data)
> 
> A resync reads all the data and checks the parity.  If it find bad
> parity (which is likely to be very rare) then it corrects it.
> So resync should run at the full speed of the hardware, just reading.

It will?  Maybe I am just mis-remembering, but I thought I saw the same note
in the log when the resync occurred (ckarray) as when a recovery was in
progress:

May 14 22:40:42 Backup kernel: [ 8083.747787] md: recovery of RAID array md0
May 14 22:40:42 Backup kernel: [ 8083.747789] md: minimum _guaranteed_
speed: 1000 KB/sec/disk.
May 14 22:40:42 Backup kernel: [ 8083.747791] md: using maximum available
idle IO bandwidth (but not more than 200000 KB/sec) for recovery.
May 14 22:40:42 Backup kernel: [ 8083.747797] md: using 128k window, over a
total of 1465138432 blocks.

Am I mis-remembering, or am I missing something here?

Oh, while I am thinking of it, ckarray runs every month on the first of the
month, but how can one trigger it manually?


^ permalink raw reply	[flat|nested] 19+ messages in thread

* RE: resync duration ?
  2009-05-15  4:04   ` Leslie Rhorer
@ 2009-05-15  4:44     ` NeilBrown
  2009-05-15  5:07       ` Leslie Rhorer
  0 siblings, 1 reply; 19+ messages in thread
From: NeilBrown @ 2009-05-15  4:44 UTC (permalink / raw)
  To: lrhorer; +Cc: 'Linux RAID'

On Fri, May 15, 2009 2:04 pm, Leslie Rhorer wrote:
>> > I'm a but confused about the duration of a raid5 resync:
>> >
>> > - occasionally, my server with 4*750 gb sata raid5 crashed (because of
>> > problems with the power supply); after rebooting it took about 10 to
>> 12
>> > hours to resync the raid5 (guess it just re-created some parity
>> > information or however it works internally, but didn't have to copy
>> > any data)
>>
>> A resync reads all the data and checks the parity.  If it find bad
>> parity (which is likely to be very rare) then it corrects it.
>> So resync should run at the full speed of the hardware, just reading.
>
> It will?  Maybe I am just mis-remembering, but I thought I saw the same
> note
> in the log when the resync occurred (ckarray) as when a recovery was in
> progress:
>
> May 14 22:40:42 Backup kernel: [ 8083.747787] md: recovery of RAID array
> md0
> May 14 22:40:42 Backup kernel: [ 8083.747789] md: minimum _guaranteed_
> speed: 1000 KB/sec/disk.
> May 14 22:40:42 Backup kernel: [ 8083.747791] md: using maximum available
> idle IO bandwidth (but not more than 200000 KB/sec) for recovery.
> May 14 22:40:42 Backup kernel: [ 8083.747797] md: using 128k window, over
> a
> total of 1465138432 blocks.
>
> Am I mis-remembering, or am I missing something here?

It isn't clear to me exactly what you are asking...

It says it won't use more than 200MB/sec, which is true. But that is
per-device, and few devices can deliver that yet.

It says "using maximum available", so it if you want the bandwidth
for something else, it will back off.

It prints the same message for resync as for recovery.



>
> Oh, while I am thinking of it, ckarray runs every month on the first of
> the
> month, but how can one trigger it manually?

  echo check > /sys/block/mdX/md/sync_action.

or "echo repair" to get it to fix anything it finds.

NeilBrown

>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>


^ permalink raw reply	[flat|nested] 19+ messages in thread

* RE: resync duration ?
  2009-05-15  4:44     ` NeilBrown
@ 2009-05-15  5:07       ` Leslie Rhorer
  2009-05-15 20:21         ` Richard Scobie
  2009-05-15 21:51         ` Neil Brown
  0 siblings, 2 replies; 19+ messages in thread
From: Leslie Rhorer @ 2009-05-15  5:07 UTC (permalink / raw)
  To: 'Linux RAID'

> >> A resync reads all the data and checks the parity.  If it find bad
> >> parity (which is likely to be very rare) then it corrects it.
> >> So resync should run at the full speed of the hardware, just reading.
> >
> > It will?  Maybe I am just mis-remembering, but I thought I saw the same
> > note
> > in the log when the resync occurred (ckarray) as when a recovery was in
> > progress:
> >
> > May 14 22:40:42 Backup kernel: [ 8083.747787] md: recovery of RAID array
> > md0
> > May 14 22:40:42 Backup kernel: [ 8083.747789] md: minimum _guaranteed_
> > speed: 1000 KB/sec/disk.
> > May 14 22:40:42 Backup kernel: [ 8083.747791] md: using maximum
> available
> > idle IO bandwidth (but not more than 200000 KB/sec) for recovery.
> > May 14 22:40:42 Backup kernel: [ 8083.747797] md: using 128k window,
> over
> > a
> > total of 1465138432 blocks.
> >
> > Am I mis-remembering, or am I missing something here?
> 
> It isn't clear to me exactly what you are asking...
> 
> It says it won't use more than 200MB/sec, which is true. But that is
> per-device, and few devices can deliver that yet.

Oh.  I thought it meant for the entire array.  Yeah, 200MBps is pretty far
out there, at least for a sustained rate.  SATA II can burst to that, but
few drives can manage much more than 50 MBps, sustained.

> It says "using maximum available", so it if you want the bandwidth
> for something else, it will back off.
> 
> It prints the same message for resync as for recovery.

Ah, so I'm not going crazy, I just have a comprehension problem.

Hmmm.  I'm not sure which is worse. :-)

> > Oh, while I am thinking of it, ckarray runs every month on the first of
> > the
> > month, but how can one trigger it manually?
> 
>   echo check > /sys/block/mdX/md/sync_action.
> 
> or "echo repair" to get it to fix anything it finds.

Thanks.  Those are pretty easy, but it would be nice if they were in the MAN
page.  Better yet, have you considered adding both as command line options
for mdadm?


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-15  1:22 ` Neil Brown
  2009-05-15  4:04   ` Leslie Rhorer
@ 2009-05-15 11:16   ` Rainer Fuegenstein
  1 sibling, 0 replies; 19+ messages in thread
From: Rainer Fuegenstein @ 2009-05-15 11:16 UTC (permalink / raw)
  To: Neil Brown; +Cc: linux-raid


> As others have said, 3 hours it a reasonable ball-part for 750Gig.
> 10-12 hours is definitely longer than I would expect.
>
> Of course, if use are using the array then resync (or recovery) will
> back off and let the regular IO have priority.  So if you were
> actively using the array during the 10-12 hours resync, then it makes
> perfect sense.

That's what puzzles me: the server is used just for storage and usually
there are nearly no accesses during the 10-12h resync period. whereas
(according to your statement above) when accessing the raid with the one
new 1.5TB disk during the 3h resync period, the estimate climbs to about
1600 minutes, but drops back to 3 hours when idle again.

could this be a result of the hardware configuration ?

old 750GB disk:
  Vendor: ATA       Model: SAMSUNG HD753LJ   Rev: 1AA0
  Type:   Direct-Access                      ANSI SCSI revision: 05

new 1.5TB disk: (western digital caviar green)
  Vendor: ATA       Model: WDC WD15EADS-00R  Rev: 01.0
  Type:   Direct-Access                      ANSI SCSI revision: 05

$ cat /proc/cpuinfo
processor       : 0
vendor_id       : AuthenticAMD
cpu family      : 15
model           : 127
model name      : AMD Sempron(tm) Processor LE-1200
stepping        : 1
cpu MHz         : 1800.000
cache size      : 512 KB
[...]

mainboard is an ASUS M2N with 4 onbord SATA ports.

anyway, when all the 1.5TB disks are active in the raid5, I'll simulate a
power failure and see how long the re-sync will take then.



^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-15  5:07       ` Leslie Rhorer
@ 2009-05-15 20:21         ` Richard Scobie
  2009-05-15 21:51         ` Neil Brown
  1 sibling, 0 replies; 19+ messages in thread
From: Richard Scobie @ 2009-05-15 20:21 UTC (permalink / raw)
  To: lrhorer; +Cc: 'Linux RAID'

Leslie Rhorer wrote:

> Thanks.  Those are pretty easy, but it would be nice if they were in the MAN
> page.  Better yet, have you considered adding both as command line options
> for mdadm?

This (and other md info0 is contained in the kernel source 
Documentation/md.txt.

Regards,

Richard

^ permalink raw reply	[flat|nested] 19+ messages in thread

* RE: resync duration ?
  2009-05-15  5:07       ` Leslie Rhorer
  2009-05-15 20:21         ` Richard Scobie
@ 2009-05-15 21:51         ` Neil Brown
  2009-05-16  2:47           ` Richard Scobie
  2009-05-18 15:00           ` Bill Davidsen
  1 sibling, 2 replies; 19+ messages in thread
From: Neil Brown @ 2009-05-15 21:51 UTC (permalink / raw)
  To: lrhorer; +Cc: 'Linux RAID'

On Friday May 15, lrhorer@satx.rr.com wrote:
> > 
> >   echo check > /sys/block/mdX/md/sync_action.
> > 
> > or "echo repair" to get it to fix anything it finds.
> 
> Thanks.  Those are pretty easy, but it would be nice if they were in the MAN
> page.  Better yet, have you considered adding both as command line options
> for mdadm?

Considered, yes.  But they are very low on my priority list.
I accept patches.
Maybe and option like
   mdadm --syncaction=check /dev/md0

??

NeilBrown

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-15 21:51         ` Neil Brown
@ 2009-05-16  2:47           ` Richard Scobie
  2009-05-16  8:10             ` NeilBrown
  2009-05-18 15:00           ` Bill Davidsen
  1 sibling, 1 reply; 19+ messages in thread
From: Richard Scobie @ 2009-05-16  2:47 UTC (permalink / raw)
  To: Neil Brown; +Cc: lrhorer, 'Linux RAID'

Neil Brown wrote:

> Considered, yes.  But they are very low on my priority list.
> I accept patches.
> Maybe and option like
>    mdadm --syncaction=check /dev/md0

Perhaps it could also be incorporated in a manner similar to the way 
smartd allows for scheduled tests - a parameter added to each array in 
the mdadm.conf would allow scheduled checks when mdadm is run as a  daemon.

Regards,

Richard

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-16  2:47           ` Richard Scobie
@ 2009-05-16  8:10             ` NeilBrown
  0 siblings, 0 replies; 19+ messages in thread
From: NeilBrown @ 2009-05-16  8:10 UTC (permalink / raw)
  To: Richard Scobie; +Cc: lrhorer, 'Linux RAID'

On Sat, May 16, 2009 12:47 pm, Richard Scobie wrote:
> Neil Brown wrote:
>
>> Considered, yes.  But they are very low on my priority list.
>> I accept patches.
>> Maybe and option like
>>    mdadm --syncaction=check /dev/md0
>
> Perhaps it could also be incorporated in a manner similar to the way
> smartd allows for scheduled tests - a parameter added to each array in
> the mdadm.conf would allow scheduled checks when mdadm is run as a
> daemon.

Thanks for the suggestion...

However I have the strange and old-fashion notion that the strength
of Unix (which is one of the foundations of Linux) is the "tools
approach".  You have a tool for each job, and it does the job properly.

In the case of scheduled checks. cron is the tool of choice.

So if you want a monthly, or yearly, or daily check, get cron
to schedule it.

Just like if you want to be regularly reminded about degraded arrays,
get cron to run "mdadm -monitor --oneshot --scan" every morning.

I am definitely not interested in adding any scheduling capabilities
to mdadm.
(is smartd does scheduled checks... more fool them :-)

NeilBrown


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-15 21:51         ` Neil Brown
  2009-05-16  2:47           ` Richard Scobie
@ 2009-05-18 15:00           ` Bill Davidsen
  2009-05-28 16:43             ` John Robinson
  1 sibling, 1 reply; 19+ messages in thread
From: Bill Davidsen @ 2009-05-18 15:00 UTC (permalink / raw)
  To: Neil Brown; +Cc: lrhorer, 'Linux RAID'

Neil Brown wrote:
> On Friday May 15, lrhorer@satx.rr.com wrote:
>   
>>>   echo check > /sys/block/mdX/md/sync_action.
>>>
>>> or "echo repair" to get it to fix anything it finds.
>>>       
>> Thanks.  Those are pretty easy, but it would be nice if they were in the MAN
>> page.  Better yet, have you considered adding both as command line options
>> for mdadm?
>>     
>
> Considered, yes.  But they are very low on my priority list.
> I accept patches.
> Maybe and option like
>    mdadm --syncaction=check /dev/md0
>
> ??
>   

That is a great idea, it puts the md administration in the mdadm program 
(where it belongs) instead of counting on a script to do the right 
thing. Well, if you do this at all, the other actions are probably 
appropriate as well.

-- 
bill davidsen <davidsen@tmr.com>
  CTO TMR Associates, Inc

"You are disgraced professional losers. And by the way, give us our money back."
    - Representative Earl Pomeroy,  Democrat of North Dakota
on the A.I.G. executives who were paid bonuses  after a federal bailout.



^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-14 13:06 resync duration ? Rainer Fuegenstein
  2009-05-14 13:58 ` Bryan Mesich
  2009-05-15  1:22 ` Neil Brown
@ 2009-05-18 15:02 ` Bill Davidsen
  2 siblings, 0 replies; 19+ messages in thread
From: Bill Davidsen @ 2009-05-18 15:02 UTC (permalink / raw)
  To: Rainer Fuegenstein; +Cc: linux-raid

Rainer Fuegenstein wrote:
> Hi guys,
>
> I'm a but confused about the duration of a raid5 resync:
>
> - occasionally, my server with 4*750 gb sata raid5 crashed (because of
> problems with the power supply); after rebooting it took about 10 to 12
> hours to resync the raid5 (guess it just re-created some parity
> information or however it works internally, but didn't have to copy
> any data)
>
>   
I have a test machine which crashes regularly (used for kernel and 
hardware tests), and that sounds long. Are you running without a bitmap, 
by any chance?

> - right now I replaced one 750GB disk with a 1.5TB disk, but now
> resyncing (according to /proc/mdstat) it is supposed to take only 160
> minutes ?! although it needs top copy data to a blank disk ?
>
> is this normal or should I be worried, especially before I pull out
> the next 750GB disk and replace it with the next 1.5TB disk ?
>
> tnx in advance.
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
>   


-- 
bill davidsen <davidsen@tmr.com>
  CTO TMR Associates, Inc

"You are disgraced professional losers. And by the way, give us our money back."
    - Representative Earl Pomeroy,  Democrat of North Dakota
on the A.I.G. executives who were paid bonuses  after a federal bailout.



^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: resync duration ?
  2009-05-18 15:00           ` Bill Davidsen
@ 2009-05-28 16:43             ` John Robinson
  0 siblings, 0 replies; 19+ messages in thread
From: John Robinson @ 2009-05-28 16:43 UTC (permalink / raw)
  To: Linux RAID

On 18/05/2009 16:00, Bill Davidsen wrote:
> Neil Brown wrote:
[...]
>> Maybe and option like
>>    mdadm --syncaction=check /dev/md0
>>
>> ??
> 
> That is a great idea, it puts the md administration in the mdadm program 
> (where it belongs) instead of counting on a script to do the right 
> thing. Well, if you do this at all, the other actions are probably 
> appropriate as well.

It would be nice if such a thing blocked until completion and reported 
the results of the sync action, instead of or as well as them going to 
the system log, so they can be emailed by cron or otherwise examined by 
the caller.

I know, I know, you accept patches :-) but my coding skills are way too 
rusty to attempt this.

Cheers,

John.


^ permalink raw reply	[flat|nested] 19+ messages in thread

end of thread, other threads:[~2009-05-28 16:43 UTC | newest]

Thread overview: 19+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-05-14 13:06 resync duration ? Rainer Fuegenstein
2009-05-14 13:58 ` Bryan Mesich
2009-05-14 16:26   ` Ryan Wagoner
2009-05-14 17:03     ` Re[2]: " Rainer Fuegenstein
2009-05-14 17:06   ` John Robinson
2009-05-14 18:07     ` Bryan Mesich
2009-05-14 19:38       ` Richard Scobie
2009-05-15  1:22 ` Neil Brown
2009-05-15  4:04   ` Leslie Rhorer
2009-05-15  4:44     ` NeilBrown
2009-05-15  5:07       ` Leslie Rhorer
2009-05-15 20:21         ` Richard Scobie
2009-05-15 21:51         ` Neil Brown
2009-05-16  2:47           ` Richard Scobie
2009-05-16  8:10             ` NeilBrown
2009-05-18 15:00           ` Bill Davidsen
2009-05-28 16:43             ` John Robinson
2009-05-15 11:16   ` Rainer Fuegenstein
2009-05-18 15:02 ` Bill Davidsen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).