md raid behavior, bad sector uncorrectable read error

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* md raid behavior, bad sector uncorrectable read error
@ 2012-08-18 22:40 Chris Murphy
  2012-08-19  8:54 ` Albert Pauw
  2012-08-19  9:47 ` Mikael Abrahamsson
  0 siblings, 2 replies; 10+ messages in thread
From: Chris Murphy @ 2012-08-18 22:40 UTC (permalink / raw)
  To: linux-raid

I'm not experiencing this at the moment, but I'm curious about what happens if it were to happen.

A drive detects a sector error, but can't correct it, and returns an 0x40 error to the system.

Presumably md raid (any RAID I would think) becomes aware of this error, as the chunk is incomplete without the sector. Next, does md raid rebuild the affected chunk from parity on-the-fly? Or is the entire drive dropped from the array and placed into degraded mode?

If the affected chunk is rebuilt from parity (or copy in case of mirror), does md raid issue a write command to write the rebuilt chunk back to the disk at the same LBAs? If so, shouldn't this force the drive to determine if the sector error is transient or persistent, and if persistent the disk firmware will remap the bad sector to a reserve sector?

Thanks,

Chris Murphy

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-18 22:40 md raid behavior, bad sector uncorrectable read error Chris Murphy
@ 2012-08-19  8:54 ` Albert Pauw
  2012-08-19 18:12   ` Chris Murphy
  2012-08-19  9:47 ` Mikael Abrahamsson
  1 sibling, 1 reply; 10+ messages in thread
From: Albert Pauw @ 2012-08-19  8:54 UTC (permalink / raw)
  To: Chris Murphy; +Cc: linux-raid

From what I know (and please correct me if I'm wrong), the drive
happily remaps the sector to a different, spare, location. Only when
it can't do that it throws an error status back. Mind you, more than
10 years ago I had a scsi disk in a linux machine which suddenly broke
down. Checking the logs I saw a lot of remap messages in the log in
the weeks before. So it maybe that the drive gives warnings back up
the controller chain.
If this is true, what is md doing with this? If the remaps increase on
that particular drive does i throw it out of the raid set?

Curious on what actually happens, interesting question.

Just my two cents,

Albert

On 19 August 2012 00:40, Chris Murphy <lists@colorremedies.com> wrote:
> I'm not experiencing this at the moment, but I'm curious about what happens if it were to happen.
>
> A drive detects a sector error, but can't correct it, and returns an 0x40 error to the system.
>
> Presumably md raid (any RAID I would think) becomes aware of this error, as the chunk is incomplete without the sector. Next, does md raid rebuild the affected chunk from parity on-the-fly? Or is the entire drive dropped from the array and placed into degraded mode?
>
> If the affected chunk is rebuilt from parity (or copy in case of mirror), does md raid issue a write command to write the rebuilt chunk back to the disk at the same LBAs? If so, shouldn't this force the drive to determine if the sector error is transient or persistent, and if persistent the disk firmware will remap the bad sector to a reserve sector?
>
> Thanks,
>
> Chris Murphy
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-19  8:54 ` Albert Pauw
@ 2012-08-19 18:12   ` Chris Murphy
  2012-08-19 22:19     ` NeilBrown
  2012-08-20  6:36     ` David Brown
  0 siblings, 2 replies; 10+ messages in thread
From: Chris Murphy @ 2012-08-19 18:12 UTC (permalink / raw)
  To: Linux RAID

On Aug 19, 2012, at 2:54 AM, Albert Pauw wrote:

> From what I know (and please correct me if I'm wrong), the drive
> happily remaps the sector to a different, spare, location.

Yes. Although there might be a difference between SATA and SAS handling, as SAS ECC is better by a lot. SATA ECC can return false corrections. Small problem.

For this thread, the context I'm curious about, is the detected but uncorrectable sector error. 

My understanding is that since the error is not corrected, the firmware won't remap the sector, but sets it as a pending sector remap until it receives a write command for that LBA. If the write is successful, pending status is removed. If the write persistently fails, the data is written to a reserved (good) sector which then gets that LBA and the bad sector is reserved (bad), no LBA. Since the LBA is the same, neither md nor the file system need to be informed of anything.

> Only when
> it can't do that it throws an error status back. Mind you, more than
> 10 years ago I had a scsi disk in a linux machine which suddenly broke
> down. Checking the logs I saw a lot of remap messages in the log in
> the weeks before. So it maybe that the drive gives warnings back up
> the controller chain.
> If this is true, what is md doing with this? If the remaps increase on
> that particular drive does i throw it out of the raid set?

What I'd like to see happen, is read errors cause chunk reconstruction from parity (or mirrored copy), and rewritten to disk. If it's just on-the-fly correction, without re-writing the chunk, then we'd need to run 'echo repair > /sys/block/mdX/md/sync_action' which would take an awfully long time in comparison.

On Aug 19, 2012, at 3:47 AM, Mikael Abrahamsson wrote:
> If I remember correctly from what has been described here before, a read error will cause a re-write with information created from parity

That's the thing I'd like to get a definitive answer one. If only it were easy to simulate bad sectors in VM's I could just test it!

> If it throws a write error, the drive is kicked from the array (because a drive with write errors is clearly defective).

That makes complete sense.

Chris Murphy

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-19 18:12   ` Chris Murphy
@ 2012-08-19 22:19     ` NeilBrown
  2012-08-20  2:06       ` Chris Murphy
  2012-08-20  6:36     ` David Brown
  1 sibling, 1 reply; 10+ messages in thread
From: NeilBrown @ 2012-08-19 22:19 UTC (permalink / raw)
  To: Chris Murphy; +Cc: Linux RAID

[-- Attachment #1: Type: text/plain, Size: 594 bytes --]

On Sun, 19 Aug 2012 12:12:38 -0600 Chris Murphy <lists@colorremedies.com>
wrote:

> On Aug 19, 2012, at 3:47 AM, Mikael Abrahamsson wrote:
> > If I remember correctly from what has been described here before, a read error will cause a re-write with information created from parity
> 
> That's the thing I'd like to get a definitive answer one. If only it were easy to simulate bad sectors in VM's I could just test it!

If only it were easy to read the documentation, and if only the documentation
were sufficiently complete.....

% man 4 md

search for 'read error'

NeilBrown

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-19 22:19     ` NeilBrown
@ 2012-08-20  2:06       ` Chris Murphy
  2012-08-20 12:36         ` Phil Turmel
  0 siblings, 1 reply; 10+ messages in thread
From: Chris Murphy @ 2012-08-20  2:06 UTC (permalink / raw)
  To: Linux RAID

On Aug 19, 2012, at 4:19 PM, NeilBrown wrote:

> On Sun, 19 Aug 2012 12:12:38 -0600 Chris Murphy <lists@colorremedies.com>
> wrote:
> 
>> On Aug 19, 2012, at 3:47 AM, Mikael Abrahamsson wrote:
>>> If I remember correctly from what has been described here before, a read error will cause a re-write with information created from parity
>> 
>> That's the thing I'd like to get a definitive answer one. If only it were easy to simulate bad sectors in VM's I could just test it!
> 
> If only it were easy to read the documentation, and if only the documentation
> were sufficiently complete.....
> 
> % man 4 md
> 
> search for 'read error'

The well deserved and diplomatic variant of RTFM. So the summary is: a write error on RAID 1456 causes md to mark the device as faulty. A read error (with recent kernels) causes correct data to overwrite the bad block *and* re-read, and if either write or re-read fails, the device is marked faulty. Very cool.

BTW, there's a lot of detail nuggets in man md that I'm not finding documented anywhere else. Nor have I found in other documentation a reference to man md until now.

Chris Murphy

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-20  2:06       ` Chris Murphy
@ 2012-08-20 12:36         ` Phil Turmel
  0 siblings, 0 replies; 10+ messages in thread
From: Phil Turmel @ 2012-08-20 12:36 UTC (permalink / raw)
  To: Chris Murphy; +Cc: Linux RAID

On 08/19/2012 10:06 PM, Chris Murphy wrote:

> BTW, there's a lot of detail nuggets in man md that I'm not finding
> documented anywhere else. Nor have I found in other documentation a
> reference to man md until now.

Seriously?  Every other man page in my mdadm installation lists "md(4)"
in the "See Also" section.

Or were you simply not aware that man pages are the *primary*
form of documentation for most core software in *nix environments?
You should consider reviewing "man man".

Phil

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-19 18:12   ` Chris Murphy
  2012-08-19 22:19     ` NeilBrown
@ 2012-08-20  6:36     ` David Brown
  1 sibling, 0 replies; 10+ messages in thread
From: David Brown @ 2012-08-20  6:36 UTC (permalink / raw)
  To: Chris Murphy; +Cc: Linux RAID

On 19/08/2012 20:12, Chris Murphy wrote:

>
> On Aug 19, 2012, at 3:47 AM, Mikael Abrahamsson wrote:
>> If I remember correctly from what has been described here before, a
>> read error will cause a re-write with information created from
>> parity
>
> That's the thing I'd like to get a definitive answer one. If only it
> were easy to simulate bad sectors in VM's I could just test it!
>

I wonder why no one has thought of this idea before...  Wait a minute, 
they have!  When you read the "man 4 md" or "man mdadm" pages, read 
about the "faulty" module.  You don't have to use a virtual machine for 
this, it has been used for testing md raid from long before virtual 
machines were practical in the PC world.

(Sorry for the sarcasm - it was just too tempting in light of Neil's 
recommendation to read the "man 4 md" page!)

Have fun playing with these things - it's good practice so you know what 
to expect if a real failure occurs.

>> If it throws a write error, the drive is kicked from the array
>> (because a drive with write errors is clearly defective).
>
> That makes complete sense.
>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-18 22:40 md raid behavior, bad sector uncorrectable read error Chris Murphy
  2012-08-19  8:54 ` Albert Pauw
@ 2012-08-19  9:47 ` Mikael Abrahamsson
  2012-08-19 10:39   ` Oliver Schinagl
  1 sibling, 1 reply; 10+ messages in thread
From: Mikael Abrahamsson @ 2012-08-19  9:47 UTC (permalink / raw)
  To: Chris Murphy; +Cc: linux-raid

On Sat, 18 Aug 2012, Chris Murphy wrote:

> If the affected chunk is rebuilt from parity (or copy in case of 
> mirror), does md raid issue a write command to write the rebuilt chunk 
> back to the disk at the same LBAs? If so, shouldn't this force the drive 
> to determine if the sector error is transient or persistent, and if 
> persistent the disk firmware will remap the bad sector to a reserve 
> sector?

If I remember correctly from what has been described here before, a read 
error will cause a re-write with information created from parity, and the 
drive should figure it out (either is successfully writes the new 
information to the existing sector, or if it can't, it writes it to a 
spare sector (re-map)). If it throws a write error, the drive is kicked 
from the array (because a drive with write errors is clearly defective).

So yes, you're correct in your assumption from what I know.

-- 
Mikael Abrahamsson    email: swmike@swm.pp.se

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
  2012-08-19  9:47 ` Mikael Abrahamsson
@ 2012-08-19 10:39   ` Oliver Schinagl
  0 siblings, 0 replies; 10+ messages in thread
From: Oliver Schinagl @ 2012-08-19 10:39 UTC (permalink / raw)
  To: Mikael Abrahamsson; +Cc: Chris Murphy, linux-raid

On 08/19/12 11:47, Mikael Abrahamsson wrote:
> On Sat, 18 Aug 2012, Chris Murphy wrote:
>
>> If the affected chunk is rebuilt from parity (or copy in case of
>> mirror), does md raid issue a write command to write the rebuilt chunk
>> back to the disk at the same LBAs? If so, shouldn't this force the
>> drive to determine if the sector error is transient or persistent, and
>> if persistent the disk firmware will remap the bad sector to a reserve
>> sector?
>
> If I remember correctly from what has been described here before, a read
> error will cause a re-write with information created from parity, and
> the drive should figure it out (either is successfully writes the new
> information to the existing sector, or if it can't, it writes it to a
> spare sector (re-map)). If it throws a write error, the drive is kicked
> from the array (because a drive with write errors is clearly defective).
>
> So yes, you're correct in your assumption from what I know.
>
Also, SMARTd would throw up errors when monitoring such drive when a 
re-map occurs, mdadm may not even notice this if the remap happens 
successfully.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: md raid behavior, bad sector uncorrectable read error
@ 2012-08-20  4:53 Richard Scobie
  0 siblings, 0 replies; 10+ messages in thread
From: Richard Scobie @ 2012-08-20  4:53 UTC (permalink / raw)
  To: Linux RAID Mailing List

Chris Murphy wrote:

 > Nor have I found in other documentation a reference to man md until
 > now.

The command

man -k KEYWORD

will search the whatis database for man pages containing KEYWORD, 
(assuming you have run "makewhatis", to generate a current database).

man -k md

or

man -k RAID

both return the md(4) man page.

Regards,

Richard

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2012-08-20 12:36 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-08-18 22:40 md raid behavior, bad sector uncorrectable read error Chris Murphy
2012-08-19  8:54 ` Albert Pauw
2012-08-19 18:12   ` Chris Murphy
2012-08-19 22:19     ` NeilBrown
2012-08-20  2:06       ` Chris Murphy
2012-08-20 12:36         ` Phil Turmel
2012-08-20  6:36     ` David Brown
2012-08-19  9:47 ` Mikael Abrahamsson
2012-08-19 10:39   ` Oliver Schinagl
  -- strict thread matches above, loose matches on Subject: below --
2012-08-20  4:53 Richard Scobie

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).