[Doubt] how is a disk marked faulty in RAID5

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* [Doubt] how is a disk marked faulty in RAID5
@ 2012-04-24 23:36 Anuj Goel
  2012-04-27 20:32 ` Anuj Goel
  2012-04-27 22:20 ` NeilBrown
  0 siblings, 2 replies; 3+ messages in thread
From: Anuj Goel @ 2012-04-24 23:36 UTC (permalink / raw)
  To: Linux RAID

Hi,

I have been looking into the RAID5 code, but unable to find how a disk
is marked faulty in a RAID5 array.
I consider the case when we try to read, say 2 sectors within a chunk
and the read fails. My understanding so far is as below:

The status of the read operation is returned in the call back function
"raid5_align_endio" registered in chunk-aligned_read().
If there was an error in the read, it LIFO adds the original bio to
the retry list and wakes up the raid5d thread.
This thread will remove the bio from the retry list and send it to
retry_aligned_read().

In retry_aligned_read(), we first compute the disk number and sector
offset within the disk using raid5_compute_sector().

1. Then we do some stripe operations, but I cannot see where the
actual read from the disk is scheduled.
2. Also, if the sector on the disk is found unreadable, according to
the RAID5 design, it should be recomputed using parity and the disk
marked FAULTY. Can you please point me to the code/functions I should
look into to understand how this is being done.
3. After one disk failure, if another disk fails, I think the RAID5
array cannot be used anymore. How is the second disk failure reported
?

This is my first tryst with Linux code, (specifically software RAID),
so I am not sure how to debug and understand the code flow.
Is code reading the only way to understand the flow, or is there some
documentation giving a high level overview of the implementation of
software RAID ?

Any suggestions will be highly appreciated !!

-- 
Best Regards,
Anuj Goel
Experimental Computer Science Lab
Stony Brook University.
Cell: +1-801-209-5873

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [Doubt] how is a disk marked faulty in RAID5
  2012-04-24 23:36 [Doubt] how is a disk marked faulty in RAID5 Anuj Goel
@ 2012-04-27 20:32 ` Anuj Goel
  2012-04-27 22:20 ` NeilBrown
  1 sibling, 0 replies; 3+ messages in thread
From: Anuj Goel @ 2012-04-27 20:32 UTC (permalink / raw)
  To: Linux RAID

Hi folks,
Can someone please help me with below doubts.
Thanks in advance !!

On Tue, Apr 24, 2012 at 7:36 PM, Anuj Goel <agoel@cs.stonybrook.edu> wrote:
> Hi,
>
> I have been looking into the RAID5 code, but unable to find how a disk
> is marked faulty in a RAID5 array.
> I consider the case when we try to read, say 2 sectors within a chunk
> and the read fails. My understanding so far is as below:
>
> The status of the read operation is returned in the call back function
> "raid5_align_endio" registered in chunk-aligned_read().
> If there was an error in the read, it LIFO adds the original bio to
> the retry list and wakes up the raid5d thread.
> This thread will remove the bio from the retry list and send it to
> retry_aligned_read().
>
> In retry_aligned_read(), we first compute the disk number and sector
> offset within the disk using raid5_compute_sector().
>
> 1. Then we do some stripe operations, but I cannot see where the
> actual read from the disk is scheduled.
> 2. Also, if the sector on the disk is found unreadable, according to
> the RAID5 design, it should be recomputed using parity and the disk
> marked FAULTY. Can you please point me to the code/functions I should
> look into to understand how this is being done.
> 3. After one disk failure, if another disk fails, I think the RAID5
> array cannot be used anymore. How is the second disk failure reported
> ?
>
> This is my first tryst with Linux code, (specifically software RAID),
> so I am not sure how to debug and understand the code flow.
> Is code reading the only way to understand the flow, or is there some
> documentation giving a high level overview of the implementation of
> software RAID ?
>
> Any suggestions will be highly appreciated !!
>
> --
> Best Regards,
> Anuj Goel
> Experimental Computer Science Lab
> Stony Brook University.



-- 
Best Regards,
Anuj Goel
Experimental Computer Science Lab
Stony Brook University.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [Doubt] how is a disk marked faulty in RAID5
  2012-04-24 23:36 [Doubt] how is a disk marked faulty in RAID5 Anuj Goel
  2012-04-27 20:32 ` Anuj Goel
@ 2012-04-27 22:20 ` NeilBrown
  1 sibling, 0 replies; 3+ messages in thread
From: NeilBrown @ 2012-04-27 22:20 UTC (permalink / raw)
  To: Anuj Goel; +Cc: Linux RAID

[-- Attachment #1: Type: text/plain, Size: 2558 bytes --]

On Tue, 24 Apr 2012 19:36:48 -0400 Anuj Goel <agoel@cs.stonybrook.edu> wrote:

> Hi,
> 
> I have been looking into the RAID5 code, but unable to find how a disk
> is marked faulty in a RAID5 array.

A call to md_error() calls back into error() and marks it fault.

> I consider the case when we try to read, say 2 sectors within a chunk
> and the read fails. My understanding so far is as below:
> 
> The status of the read operation is returned in the call back function
> "raid5_align_endio" registered in chunk-aligned_read().
> If there was an error in the read, it LIFO adds the original bio to
> the retry list and wakes up the raid5d thread.
> This thread will remove the bio from the retry list and send it to
> retry_aligned_read().
> 
> In retry_aligned_read(), we first compute the disk number and sector
> offset within the disk using raid5_compute_sector().
> 
> 1. Then we do some stripe operations, but I cannot see where the
> actual read from the disk is scheduled.

retry_alignd_read calls handle_stripe()
handle_stripe() calls handle_stripe_fill()
handle_stripe_fill() calls fetch_block which sets R5_Wantread on the target
'dev', or on every other dev (later it will set R5_Wantcompute on the target
dev).
handle_stripe() calls ops_run_io which notices R5_Wantread an schedules the
read.

> 2. Also, if the sector on the disk is found unreadable, according to
> the RAID5 design, it should be recomputed using parity and the disk
> marked FAULTY. Can you please point me to the code/functions I should
> look into to understand how this is being done.

fetch_block is probably the important bit to read and understand
Note how it sets STRIPE_OP_COMPUTE_BLOCK and subsequently __raid_run_ops()
notices that and runs ops_run_compute5() which sets up the xor process.


> 3. After one disk failure, if another disk fails, I think the RAID5
> array cannot be used anymore. How is the second disk failure reported
> ?

What exactly to you mean by "reported".
The second disk is marked as faulty
Subsequent writes and some reads will fail.
See the
	if (s.failed > conf->max_degraded) {
branch in handle_stripe()


> 
> This is my first tryst with Linux code, (specifically software RAID),
> so I am not sure how to debug and understand the code flow.
> Is code reading the only way to understand the flow, or is there some
> documentation giving a high level overview of the implementation of
> software RAID ?
> 
> Any suggestions will be highly appreciated !!
> 

NeilBrown


[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2012-04-27 22:20 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-04-24 23:36 [Doubt] how is a disk marked faulty in RAID5 Anuj Goel
2012-04-27 20:32 ` Anuj Goel
2012-04-27 22:20 ` NeilBrown

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).