* RAID5/6 slow due to a member device -- how to diagnose?
@ 2016-03-03 14:30 Roman Mamedov
2016-03-03 14:35 ` Mateusz Korniak
2016-03-03 14:37 ` Carlos Carvalho
0 siblings, 2 replies; 4+ messages in thread
From: Roman Mamedov @ 2016-03-03 14:30 UTC (permalink / raw)
To: linux-raid
[-- Attachment #1: Type: text/plain, Size: 773 bytes --]
Hello,
Let's suppose I am seeing slow write performance on a RAID5 or RAID6, and I am
suspecting that it's because of one of its devices operating slower than
others (maybe a soft-failing or just low-performing hard drive, in case with
non-identical member drives). Nothing in dmesg, no timeouts or ATA errors or
anything like that.
How would I go about diagnosing which device that is (if any) is slowing down
the array, aside from performance-testing each individual device (which seems
rather difficult if it's only slow on writes, not reads).
In 'top' there's of course high iowait, 50-70% all the time during writes. But
is that broken down somewhere to show which particular block device is being
the most 'waited for'?
--
With respect,
Roman
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 181 bytes --]
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: RAID5/6 slow due to a member device -- how to diagnose?
2016-03-03 14:30 RAID5/6 slow due to a member device -- how to diagnose? Roman Mamedov
@ 2016-03-03 14:35 ` Mateusz Korniak
2016-03-03 15:00 ` Roman Mamedov
2016-03-03 14:37 ` Carlos Carvalho
1 sibling, 1 reply; 4+ messages in thread
From: Mateusz Korniak @ 2016-03-03 14:35 UTC (permalink / raw)
To: Roman Mamedov; +Cc: linux-raid
On Thursday 03 March 2016 19:30:26 Roman Mamedov wrote:
> But is that broken down somewhere to show which particular block device is
> being the most 'waited for'?
iostat ?
r_await w_await svctm %util columns?
Regards,
--
Mateusz Korniak
"(...) mam brata - poważny, domator, liczykrupa, hipokryta, pobożniś,
krótko mówiąc - podpora społeczeństwa."
Nikos Kazantzakis - "Grek Zorba"
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: RAID5/6 slow due to a member device -- how to diagnose?
2016-03-03 14:30 RAID5/6 slow due to a member device -- how to diagnose? Roman Mamedov
2016-03-03 14:35 ` Mateusz Korniak
@ 2016-03-03 14:37 ` Carlos Carvalho
1 sibling, 0 replies; 4+ messages in thread
From: Carlos Carvalho @ 2016-03-03 14:37 UTC (permalink / raw)
To: linux-raid
Roman Mamedov (rm@romanrm.net) wrote on Thu, Mar 03, 2016 at 11:30:26AM BRT:
> Let's suppose I am seeing slow write performance on a RAID5 or RAID6, and I am
> suspecting that it's because of one of its devices operating slower than
> others (maybe a soft-failing or just low-performing hard drive, in case with
> non-identical member drives). Nothing in dmesg, no timeouts or ATA errors or
> anything like that.
>
> How would I go about diagnosing which device that is (if any) is slowing down
> the array, aside from performance-testing each individual device (which seems
> rather difficult if it's only slow on writes, not reads).
I use sar -d -p 5 |grep -v md
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: RAID5/6 slow due to a member device -- how to diagnose?
2016-03-03 14:35 ` Mateusz Korniak
@ 2016-03-03 15:00 ` Roman Mamedov
0 siblings, 0 replies; 4+ messages in thread
From: Roman Mamedov @ 2016-03-03 15:00 UTC (permalink / raw)
To: Mateusz Korniak; +Cc: linux-raid
[-- Attachment #1: Type: text/plain, Size: 875 bytes --]
On Thu, 03 Mar 2016 15:35:21 +0100
Mateusz Korniak <mateusz-lists@ant.gliwice.pl> wrote:
> On Thursday 03 March 2016 19:30:26 Roman Mamedov wrote:
> > But is that broken down somewhere to show which particular block device is
> > being the most 'waited for'?
>
> iostat ?
>
> r_await w_await svctm %util columns?
Cheers -- never knew there was this 'extended' output mode in iostat.
Indeed it pinpoints the culprit nicely.
Device: [...] wkB/s avgrq-sz avgqu-sz await r_await w_await svctm %util
sde1 [...] 53748.00 969.01 1.95 17.46 68.00 17.00 4.09 45.40
sdf1 [...] 49410.75 1176.97 32.09 393.95 106.00 397.42 11.90 100.00
sdg1 [...] 50230.00 995.17 7.12 66.81 88.00 66.60 6.08 61.40
sdh1 [...] 53110.00 970.89 2.46 22.21 46.40 21.64 4.64 50.80
--
With respect,
Roman
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 181 bytes --]
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2016-03-03 15:00 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-03-03 14:30 RAID5/6 slow due to a member device -- how to diagnose? Roman Mamedov
2016-03-03 14:35 ` Mateusz Korniak
2016-03-03 15:00 ` Roman Mamedov
2016-03-03 14:37 ` Carlos Carvalho
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox