From: Eryu Guan <guaneryu@gmail.com>
To: Anand Jain <Anand.Jain@oracle.com>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: [PATCH v2] Btrfs: return failure if btrfs_dev_replace_finishing() failed
Date: Tue, 14 Oct 2014 11:35:45 +0800 [thread overview]
Message-ID: <20141014033545.GL13950@dhcp-13-216.nay.redhat.com> (raw)
In-Reply-To: <543BA6DC.9060106@oracle.com>
On Mon, Oct 13, 2014 at 06:18:04PM +0800, Anand Jain wrote:
>
>
> On 10/13/14 14:59, Eryu Guan wrote:
> >On Mon, Oct 13, 2014 at 02:23:57PM +0800, Anand Jain wrote:
> >>
> >>
> >>comments below..
> >>
> >>
> >>On 10/13/14 12:42, Eryu Guan wrote:
> >>>device replace could fail due to another running scrub process or any
> >>>other errors btrfs_scrub_dev() may hit, but this failure doesn't get
> >>>returned to userspace.
> >>>
> >>>The following steps could reproduce this issue
> >>>
> >>> mkfs -t btrfs -f /dev/sdb1 /dev/sdb2
> >>> mount /dev/sdb1 /mnt/btrfs
> >>> while true; do btrfs scrub start -B /mnt/btrfs >/dev/null 2>&1; done &
> >>> btrfs replace start -Bf /dev/sdb2 /dev/sdb3 /mnt/btrfs
> >>> # if this replace succeeded, do the following and repeat until
> >>> # you see this log in dmesg
> >>> # BTRFS: btrfs_scrub_dev(/dev/sdb2, 2, /dev/sdb3) failed -115
> >>> #btrfs replace start -Bf /dev/sdb3 /dev/sdb2 /mnt/btrfs
> >>>
> >>> # once you see the error log in dmesg, check return value of
> >>> # replace
> >>> echo $?
> >>>
> >>>Introduce a new dev replace result
> >>>
> >>>BTRFS_IOCTL_DEV_REPLACE_RESULT_SCRUB_INPROGRESS
> >>>
> >>>to catch -EINPROGRESS explicitly and return other errors directly to
> >>>userspace.
> >>>
> >>>Signed-off-by: Eryu Guan <guaneryu@gmail.com>
> >>>---
> >>>
> >>>v2:
> >>>- set result to SCRUB_INPROGRESS if btrfs_scrub_dev returned -EINPROGRESS
> >>> and return 0 as Miao Xie suggested
> >>>
> >>> fs/btrfs/dev-replace.c | 12 +++++++++---
> >>> include/uapi/linux/btrfs.h | 1 +
> >>> 2 files changed, 10 insertions(+), 3 deletions(-)
> >>>
> >>>diff --git a/fs/btrfs/dev-replace.c b/fs/btrfs/dev-replace.c
> >>>index eea26e1..a141f8b 100644
> >>>--- a/fs/btrfs/dev-replace.c
> >>>+++ b/fs/btrfs/dev-replace.c
> >>>@@ -418,9 +418,15 @@ int btrfs_dev_replace_start(struct btrfs_root *root,
> >>> &dev_replace->scrub_progress, 0, 1);
> >>>
> >>> ret = btrfs_dev_replace_finishing(root->fs_info, ret);
> >>>- WARN_ON(ret);
> >>>+ /* don't warn if EINPROGRESS, someone else might be running scrub */
> >>>+ if (ret == -EINPROGRESS) {
> >>>+ args->result = BTRFS_IOCTL_DEV_REPLACE_RESULT_SCRUB_INPROGRESS;
> >>>+ ret = 0;
> >>>+ } else {
> >>>+ WARN_ON(ret);
> >>>+ }
>
>
> I am bit concerned, why these racing threads here aren't excluding
> each other using "mutually_exclusive_operation_running" ? as most
> of the other device operation thread does.
>
> Thanks, Anand
btrfs_ioctl_scrub() doesn't use mutually_exclusive_operation_running
as other device operations do, I'm not sure if it should(seems scrub
should do it too to me).
But I think that's a different problem from the one I'm trying to fix
here. The main purpose is to return error to userspace when
btrfs_scrub_dev() hit some error. Dealing with -EINPROGRESS is to
match the current behavior(replace and scrub could run at the same
time).
Thanks,
Eryu
>
> >> looks like was are trying to manage EINPROGRESS returned by
> >
> >Yes, that's right.
> >
> >> btrfs_dev_replace_finishing(). In btrfs_dev_replace_finishing()
> >> which specific func call is returning EINPROGRESS ? I didn't go
> >> deep enough.
> >
> >btrfs_dev_replace_finishing() will check the scrub_ret(the last
> >argument), and return scrub_ret if (!scrub_ret). It was returning 0
> >unconditionally before this patch.
> >
> >btrfs_dev_replace_start@fs/btrfs/dev-replace.c
> > 416 ret = btrfs_scrub_dev(fs_info, src_device->devid, 0,
> > 417 src_device->total_bytes,
> > 418 &dev_replace->scrub_progress, 0, 1);
> > 419
> > 420 ret = btrfs_dev_replace_finishing(root->fs_info, ret);
> >
> >and btrfs_dev_replace_finishing@fs/btrfs/dev-replace.c
> > 529 if (!scrub_ret) {
> > 530 btrfs_dev_replace_update_device_in_mapping_tree(fs_info,
> > 531 src_device,
> > 532 tgt_device);
> > 533 } else {
> >......
> > 547 return scrub_ret;
> > 548 }
>
>
>
>
>
> >>
> >> And how do we handle if replace is intervened by balance
> >> instead of scrub ?
> >
> >Based on my test, replace ioctl would return -ENOENT if balance is
> >running
> >
> >ERROR: ioctl(DEV_REPLACE_START) failed on "/mnt/testarea/scratch": No such file or directory, no error
> >
> >(I haven't gone through this codepath yet and don't know where -ENOENT
> >comes from, but I don't think it's a proper errno,
> >/mnt/testarea/scratch is definitely there)
> >>
> >> sorry if I missed something.
> >>
> >>Anand
> >
> >Thanks for the review!
> >
> >Eryu
> >>
> >>
> >>>- return 0;
> >>>+ return ret;
> >>>
> >>> leave:
> >>> dev_replace->srcdev = NULL;
> >>>@@ -538,7 +544,7 @@ static int btrfs_dev_replace_finishing(struct btrfs_fs_info *fs_info,
> >>> btrfs_destroy_dev_replace_tgtdev(fs_info, tgt_device);
> >>> mutex_unlock(&dev_replace->lock_finishing_cancel_unmount);
> >>>
> >>>- return 0;
> >>>+ return scrub_ret;
> >>> }
> >>>
> >>> printk_in_rcu(KERN_INFO
> >>>diff --git a/include/uapi/linux/btrfs.h b/include/uapi/linux/btrfs.h
> >>>index 2f47824..611e1c5 100644
> >>>--- a/include/uapi/linux/btrfs.h
> >>>+++ b/include/uapi/linux/btrfs.h
> >>>@@ -157,6 +157,7 @@ struct btrfs_ioctl_dev_replace_status_params {
> >>> #define BTRFS_IOCTL_DEV_REPLACE_RESULT_NO_ERROR 0
> >>> #define BTRFS_IOCTL_DEV_REPLACE_RESULT_NOT_STARTED 1
> >>> #define BTRFS_IOCTL_DEV_REPLACE_RESULT_ALREADY_STARTED 2
> >>>+#define BTRFS_IOCTL_DEV_REPLACE_RESULT_SCRUB_INPROGRESS 3
> >>> struct btrfs_ioctl_dev_replace_args {
> >>> __u64 cmd; /* in */
> >>> __u64 result; /* out */
> >>>
prev parent reply other threads:[~2014-10-14 3:35 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-10-13 4:42 [PATCH v2] Btrfs: return failure if btrfs_dev_replace_finishing() failed Eryu Guan
2014-10-13 4:42 ` [PATCH] btrfs-progs: add new dev replace result Eryu Guan
2014-10-13 6:23 ` [PATCH v2] Btrfs: return failure if btrfs_dev_replace_finishing() failed Anand Jain
2014-10-13 6:59 ` Eryu Guan
2014-10-13 10:18 ` Anand Jain
2014-10-14 3:35 ` Eryu Guan [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20141014033545.GL13950@dhcp-13-216.nay.redhat.com \
--to=guaneryu@gmail.com \
--cc=Anand.Jain@oracle.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).