linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* raid1 IO hang - md_submit_flush_data ignores md_write_start return value
@ 2017-09-20 12:49 Nate Dailey
  2017-09-20 19:35 ` Shaohua Li
  0 siblings, 1 reply; 3+ messages in thread
From: Nate Dailey @ 2017-09-20 12:49 UTC (permalink / raw)
  To: linux-raid

I'm seeing an occasional XFS IO hang with raid1 (INFO: task xfsaild/md20:17963 
blocked for more than 120 seconds).

It turns out that this is because md_submit_flush_data calls pers->make_request, 
and doesn't check the return value (unlike md_make_request, which checks the 
return value and retries). So if raid1_make_request/md_write_start return false, 
md_submit_flush_data drops the write on the floor.

I'm hitting this on a RHEL kernel, but looking at the upstream code it appears 
that the same thing could happen.

Not sure how best to deal with this... thank you for any advice!

Nate



^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2017-09-21 15:42 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-09-20 12:49 raid1 IO hang - md_submit_flush_data ignores md_write_start return value Nate Dailey
2017-09-20 19:35 ` Shaohua Li
2017-09-21 15:42   ` Nate Dailey

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).