From: Boris Burkov <boris@bur.io>
To: David Sterba <dsterba@suse.com>
Cc: linux-btrfs@vger.kernel.org, stable@vger.kernel.org
Subject: Re: [PATCH] btrfs: always scan a single device when mounted
Date: Tue, 6 Feb 2024 16:21:15 -0800 [thread overview]
Message-ID: <ZcLM+2mtKFaVUFsF@devvm12410.ftw0.facebook.com> (raw)
In-Reply-To: <20240205174340.30327-1-dsterba@suse.com>
On Mon, Feb 05, 2024 at 06:43:39PM +0100, David Sterba wrote:
> There are reports that since version 6.7 update-grub fails to find the
> device of the root on systems without initrd and on a single device.
>
> This looks like the device name changed in the output of
> /proc/self/mountinfo:
>
> 6.5-rc5 working
>
> 18 1 0:16 / / rw,noatime - btrfs /dev/sda8 ...
>
> 6.7 not working:
>
> 17 1 0:15 / / rw,noatime - btrfs /dev/root ...
>
> and "update-grub" shows this error:
>
> /usr/sbin/grub-probe: error: cannot find a device for / (is /dev mounted?)
>
> This looks like it's related to the device name, but grub-probe
> recognizes the "/dev/root" path and tries to find the underlying device.
> However there's a special case for some filesystems, for btrfs in
> particular.
>
> The generic root device detection heuristic is not done and it all
> relies on reading the device infos by a btrfs specific ioctl. This ioctl
> returns the device name as it was saved at the time of device scan (in
> this case it's /dev/root).
>
> The change in 6.7 for temp_fsid to allow several single device
> filesystem to exist with the same fsid (and transparently generate a new
> UUID at mount time) was to skip caching/registering such devices.
>
> This also skipped mounted device. One step of scanning is to check if
> the device name hasn't changed, and if yes then update the cached value.
>
> This broke the grub-probe as it always read the device /dev/root and
> couldn't find it in the system. A temporary workaround is to create a
> symlink but this does not survive reboot.
>
> The right fix is to allow updating the device path of a mounted
> filesystem even if this is a single device one. This does not affect the
> temp_fsid feature, the UUID of the mounted filesystem remains the same
> and the matching is based on device major:minor which is unique per
> mounted filesystem.
>
> As the main part of device scanning and list update is done in
> device_list_add() that handles all corner cases and locking, it is
> extended to take a parameter that tells it to do everything as before,
> except adding a new device entry.
>
> This covers the path when the device (that exists for all mounted
> devices) name changes, updating /dev/root to /dev/sdx. Any other single
> device with filesystem is skipped.
>
> Note that if a system is booted and initial mount is done on the
> /dev/root device, this will be the cached name of the device. Only after
> the command "btrfs device rescan" it will change as it triggers the
> rename.
>
> The fix was verified by users whose systems were affected.
>
> CC: stable@vger.kernel.org # 6.7+
> Fixes: bc27d6f0aa0e ("btrfs: scan but don't register device on single device filesystem")
> Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=218353
> Link: https://lore.kernel.org/lkml/CAKLYgeJ1tUuqLcsquwuFqjDXPSJpEiokrWK2gisPKDZLs8Y2TQ@mail.gmail.com/
> Signed-off-by: David Sterba <dsterba@suse.com>
Reviewed-by: Boris Burkov <boris@bur.io>
> ---
> fs/btrfs/volumes.c | 30 ++++++++++++++----------------
> 1 file changed, 14 insertions(+), 16 deletions(-)
>
> diff --git a/fs/btrfs/volumes.c b/fs/btrfs/volumes.c
> index 474ab7ed65ea..f2c2f7ca5c3d 100644
> --- a/fs/btrfs/volumes.c
> +++ b/fs/btrfs/volumes.c
> @@ -738,6 +738,7 @@ static noinline struct btrfs_device *device_list_add(const char *path,
> bool same_fsid_diff_dev = false;
> bool has_metadata_uuid = (btrfs_super_incompat_flags(disk_super) &
> BTRFS_FEATURE_INCOMPAT_METADATA_UUID);
> + bool can_create_new = *new_device_added;
It took me quite a while to figure out all the intended logic with the
now in/out parameter. I think it's probably too cute? Why not just add
another parameter "can_create_new_device"? I think it feels kind of
weird on the caller side too, to set "new_device_added" to true, but
then still rely on it to actually get set to true.
Once I got past this, the logic made sense, so definitely don't block
yourself on this nit.
>
> if (btrfs_super_flags(disk_super) & BTRFS_SUPER_FLAG_CHANGING_FSID_V2) {
> btrfs_err(NULL,
> @@ -753,6 +754,7 @@ static noinline struct btrfs_device *device_list_add(const char *path,
> return ERR_PTR(error);
> }
>
> + *new_device_added = false;
> fs_devices = find_fsid_by_device(disk_super, path_devt, &same_fsid_diff_dev);
>
> if (!fs_devices) {
> @@ -804,6 +806,15 @@ static noinline struct btrfs_device *device_list_add(const char *path,
> return ERR_PTR(-EBUSY);
> }
>
> + if (!can_create_new) {
> + pr_info(
> + "BTRFS: device fsid %pU devid %llu transid %llu %s skip registration scanned by %s (%d)\n",
> + disk_super->fsid, devid, found_transid, path,
> + current->comm, task_pid_nr(current));
> + mutex_unlock(&fs_devices->device_list_mutex);
> + return NULL;
> + }
> +
> nofs_flag = memalloc_nofs_save();
> device = btrfs_alloc_device(NULL, &devid,
> disk_super->dev_item.uuid, path);
> @@ -1355,27 +1366,14 @@ struct btrfs_device *btrfs_scan_one_device(const char *path, blk_mode_t flags,
> goto error_bdev_put;
> }
>
> - if (!mount_arg_dev && btrfs_super_num_devices(disk_super) == 1 &&
> - !(btrfs_super_flags(disk_super) & BTRFS_SUPER_FLAG_SEEDING)) {
> - dev_t devt;
> -
> - ret = lookup_bdev(path, &devt);
> - if (ret)
> - btrfs_warn(NULL, "lookup bdev failed for path %s: %d",
> - path, ret);
> - else
> - btrfs_free_stale_devices(devt, NULL);
> -
> - pr_debug("BTRFS: skip registering single non-seed device %s\n", path);
> - device = NULL;
> - goto free_disk_super;
> - }
> + if (mount_arg_dev || btrfs_super_num_devices(disk_super) != 1 ||
> + (btrfs_super_flags(disk_super) & BTRFS_SUPER_FLAG_SEEDING))
> + new_device_added = true;
This is the line I was referring to in the comment above, fwiw.
>
> device = device_list_add(path, disk_super, &new_device_added);
> if (!IS_ERR(device) && new_device_added)
> btrfs_free_stale_devices(device->devt, device);
>
> -free_disk_super:
> btrfs_release_disk_super(disk_super);
>
> error_bdev_put:
>
> --
> 2.42.1
>
next prev parent reply other threads:[~2024-02-07 0:21 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-02-05 17:43 [PATCH] btrfs: always scan a single device when mounted David Sterba
2024-02-07 0:21 ` Boris Burkov [this message]
2024-02-07 9:28 ` David Sterba
2024-02-07 5:14 ` Anand Jain
2024-02-07 9:24 ` David Sterba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZcLM+2mtKFaVUFsF@devvm12410.ftw0.facebook.com \
--to=boris@bur.io \
--cc=dsterba@suse.com \
--cc=linux-btrfs@vger.kernel.org \
--cc=stable@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox