From: Shaohua Li <shli@kernel.org>
To: Alexey Obitotskiy <aleksey.obitotskiy@intel.com>
Cc: linux-raid@vger.kernel.org
Subject: Re: [PATCH][RESEND] md: Prevent IO hold during accessing to faulty raid5 array
Date: Fri, 5 Aug 2016 22:01:01 -0700 [thread overview]
Message-ID: <20160806050101.GA53971@kernel.org> (raw)
In-Reply-To: <1470211376-25201-1-git-send-email-aleksey.obitotskiy@intel.com>
On Wed, Aug 03, 2016 at 10:02:56AM +0200, Alexey Obitotskiy wrote:
> After array enters in faulty state (e.g. number of failed drives
> becomes more then accepted for raid5 level) it sets error flags
> (one of this flags is MD_CHANGE_PENDING). For internal metadata
> arrays MD_CHANGE_PENDING cleared into md_update_sb, but not for
> external metadata arrays. MD_CHANGE_PENDING flag set prevents to
> finish all new or non-finished IOs to array and hold them in
> pending state. In some cases this can leads to deadlock situation.
>
> For example, we have faulty array (2 of 4 drives failed) and
> udev handle array state changes and blkid started (or other
> userspace application that used array to read/write) but unable
> to finish reads due to IO hold. At the same time we unable to get
> exclusive access to array (to stop array in our case) because
> another external application still use this array.
>
> Fix makes possible to return IO with errors immediately.
> So external application can finish working with array and
> give exclusive access to other applications to perform
> required management actions with array.
>
> Signed-off-by: Alexey Obitotskiy <aleksey.obitotskiy@intel.com>
> ---
> drivers/md/raid5.c | 4 +++-
> 1 file changed, 3 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/md/raid5.c b/drivers/md/raid5.c
> index 6c1149d..99471b6 100644
> --- a/drivers/md/raid5.c
> +++ b/drivers/md/raid5.c
> @@ -4692,7 +4692,9 @@ finish:
> }
>
> if (!bio_list_empty(&s.return_bi)) {
> - if (test_bit(MD_CHANGE_PENDING, &conf->mddev->flags)) {
> + if (test_bit(MD_CHANGE_PENDING, &conf->mddev->flags) &&
> + (s.failed <= conf->max_degraded ||
> + conf->mddev->external == 0)) {
> spin_lock_irq(&conf->device_lock);
> bio_list_merge(&conf->return_bi, &s.return_bi);
> spin_unlock_irq(&conf->device_lock);
So the external metadata array will have the potential race Neil's patch
(c3cce6cda162eb) tried to fix. But we probably can't do too much for it.
Applied this one.
Thanks,
Shaohua
prev parent reply other threads:[~2016-08-06 5:01 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-08-03 8:02 [PATCH][RESEND] md: Prevent IO hold during accessing to faulty raid5 array Alexey Obitotskiy
2016-08-06 5:01 ` Shaohua Li [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160806050101.GA53971@kernel.org \
--to=shli@kernel.org \
--cc=aleksey.obitotskiy@intel.com \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).