From: Anand Jain <anand.jain@oracle.com>
To: dsterba@suse.cz, linux-btrfs@vger.kernel.org
Subject: Re: [PATCH v2 2/4] btrfs: fix race between free_stale_devices and close_fs_devices
Date: Wed, 8 Aug 2018 17:51:31 +0800 [thread overview]
Message-ID: <261bd905-7811-0acf-a609-5606a21d8b2e@oracle.com> (raw)
In-Reply-To: <20180807145948.GH3218@twin.jikos.cz>
On 08/07/2018 10:59 PM, David Sterba wrote:
> On Thu, Aug 02, 2018 at 05:29:12PM +0800, Anand Jain wrote:
>>
>>
>> On 08/01/2018 10:29 PM, David Sterba wrote:
>>> On Thu, Jul 26, 2018 at 02:53:32PM +0800, Anand Jain wrote:
>>>> From: Anand Jain <Anand.Jain@oracle.com>
>>>>
>>>> %fs_devices can be free-ed by btrfs_free_stale_devices() when the
>>>> close_fs_devices() drops fs_devices::opened to zero, but close_fs_devices
>>>> tries to access the %fs_devices again without the device_list_mutex.
>>>>
>>>> Fix this by bringing the %fs_devices access with in the device_list_mutex.
>>>
>>> AFAICS this cannot happen anymore because the two calls are serialized
>>> by the uuid_mutex. But this was not the case when syzbot reported the
>>> problem where your patch would apply.
>>>
>>> The parallell access to opened and device list cannot happen when:
>>>
>>> * btrfs_scan_one_device that wants to call btrfs_free_stale_devices
>>> * btrfs_close_devices calls close_fs_devices
>>>
>>> Fixed by the series:
>>>
>>> btrfs: lift uuid_mutex to callers of btrfs_scan_one_device
>>> btrfs: lift uuid_mutex to callers of btrfs_open_devices
>>> btrfs: lift uuid_mutex to callers of btrfs_parse_early_options
>>> btrfs: reorder initialization before the mount locks uuid_mutex
>>> btrfs: fix mount and ioctl device scan ioctl race
>>>
>>> If there's a race I don't see, please describe in more detail.
>>
>> Right. There is no race with the uuid_mutex patches as above.
>>
>> And I just found this- can we make close be consistent with its
>> open part.
>> btrfs_open_devices() hold device_list_mutex before the update to
>> fs_devices::opened. So close_fs_device() could do the same, and be
>> theoretically correct.
>
> Or it can be the other way around, to push the device_list_mutex only
> around the list_sort and open_fs_devices like:
> --- a/fs/btrfs/volumes.c
> +++ b/fs/btrfs/volumes.c
> @@ -1144,15 +1144,15 @@ int btrfs_open_devices(struct btrfs_fs_devices *fs_devices,
>
> lockdep_assert_held(&uuid_mutex);
>
> - mutex_lock(&fs_devices->device_list_mutex);
> if (fs_devices->opened) {
> fs_devices->opened++;
> ret = 0;
> } else {
> + mutex_lock(&fs_devices->device_list_mutex);
> list_sort(NULL, &fs_devices->devices, devid_cmp);
> ret = open_fs_devices(fs_devices, flags, holder);
> + mutex_unlock(&fs_devices->device_list_mutex);
> }
> - mutex_unlock(&fs_devices->device_list_mutex);
>
> return ret;
> }
>
Right. That it will be also true in terms of consistency. And I looked
around I don't find any reason why not.
Next question - btrfs_free_stale_devices() need a better way to check
if the device is opened by the FS. Currently it relays on the
fs_devices::opened.
And while doing that, both uuid_mutex and device_list_mutex are held
and since we are depend on uuid_mutex we don't need device_list_mutex.
uuid_mutex is too gross could stall operation on other FSID.
We can drop this patch. Do you want a patch to fix the
btrfs_open_devices() consistency?
Thanks, Anand
next prev parent reply other threads:[~2018-08-08 12:06 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-07-26 6:53 [PATCH v2 0/4] Misc volume patch set part2 Anand Jain
2018-07-26 6:53 ` [PATCH v4 1/4] btrfs: drop uuid_mutex in btrfs_free_extra_devids() Anand Jain
2018-07-26 6:53 ` [PATCH v2 2/4] btrfs: fix race between free_stale_devices and close_fs_devices Anand Jain
2018-08-01 14:29 ` David Sterba
2018-08-02 9:29 ` Anand Jain
2018-08-07 14:59 ` David Sterba
2018-08-08 9:51 ` Anand Jain [this message]
2018-07-26 6:53 ` [PATCH 3/4] btrfs: bug_on for num_devices below 0 Anand Jain
2018-07-26 6:53 ` [PATCH v2 4/4] btrfs: add helper btrfs_num_devices() to deduce num_devices Anand Jain
2018-08-01 14:41 ` David Sterba
2018-08-02 10:09 ` Anand Jain
2018-08-02 10:09 ` [PATCH v3 " Anand Jain
2018-08-02 10:11 ` Nikolay Borisov
2018-08-02 12:21 ` David Sterba
2018-08-02 13:07 ` Anand Jain
2018-08-07 15:02 ` David Sterba
2018-08-07 22:43 ` Anand Jain
2018-08-03 12:45 ` [PATCH v4 " Anand Jain
2018-08-03 12:45 ` [PATCH] btrfs: handle the BUG_ON in btrfs_num_devices() Anand Jain
2018-08-03 13:33 ` Nikolay Borisov
2018-08-06 8:57 ` Anand Jain
2018-08-07 17:09 ` David Sterba
2018-08-07 22:51 ` Anand Jain
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=261bd905-7811-0acf-a609-5606a21d8b2e@oracle.com \
--to=anand.jain@oracle.com \
--cc=dsterba@suse.cz \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).