From: Anand Jain <anajain.sg@gmail.com>
To: Theodore Tso <tytso@mit.edu>
Cc: Christoph Hellwig <hch@infradead.org>,
"Darrick J. Wong" <djwong@kernel.org>,
linux-ext4@vger.kernel.org, linux-btrfs@vger.kernel.org,
linux-xfs@vger.kernel.org, Anand Jain <asj@kernel.org>,
dsterba@suse.com
Subject: Re: [PATCH v2 3/3] ext4: derive f_fsid from block device to avoid collisions
Date: Thu, 16 Apr 2026 23:21:41 +0800 [thread overview]
Message-ID: <b593ab17-afaf-4128-97eb-0ab9c23dec5c@gmail.com> (raw)
In-Reply-To: <20260409131238.GC18443@macsyma-wired.lan>
On 9/4/26 21:12, Theodore Tso wrote:
> On Thu, Apr 09, 2026 at 05:45:24PM +0800, Anand Jain wrote:
>>
>> Got it. Do you mean that since both filesystems are identical,
>> statfs(A) and statfs(B) can legitimately return the same values?
>
> Yes. f_Fsid can legitimately always be zero (which I believe is the
> case for FreeBSD, but I understand that there are some programs, like
> systemd, which subscribe to the heresy, "All the World's Linux", which
> is a variant of the "All the World's a Vax" or "All the World's SunOS"
> at the beginning of my career :-).
>
>> I'm not entirely sure what the correct expectation for f_fsid
>> should be.
>
> That's my point, there *is* no correct expectation, and I don't
> believe there can or should be. What we should be doing instead is
> actively discouraging people from using f_fsid. I suspect that's one
> of the reasons why FreeBSD may have chosen to just return zero.
>
> Which is why I don't think we should be testing this in xfstests's
> generic/791, either. (Unless we get consensus across file system
> developers abnd willing to make it be a documented behavior as of a
> particular kernel version, and we then adjust the test to skip it if
> it's older than that kernel version, so it doesn't break LTS kernel
> tests. See below....)
>
Yes, the idea for generic/79[0-5] was really just to make sure
we don't accidentally change s_uuid or f_fsid behavior without
realizing it. It gives us a baseline for current and LTS kernels
if f_fsid/s_uuid is changed. (Some of the submitted test cases
may still need revision).
>> My initial idea was to make f_fsid behavior consistent across
>> major filesystems so that user space benefits from predictable
>> semantics.
>
> I'm OK with that, so long as it's unconditional across all file system
> types (ideally) or unconditionally across all major file systems (xfs,
> btrfs, ext4, f2fs) as of a particular kernel version (which is
> probably much more realistic), *and* it is documented in the Linux man
> pages as this is the standard behavior starting with 7.1 (or
> whatever), and that the man page further cautions that programs that
> expect to be portable to other OS's (MacOS, FreeBSD, Solaris, etc.)
> should not count on this behavior.
>
On second thought, we should perhaps consider a more robust ID,
let's call it `f_fsid_v2`. More on `f_fsid_v2` below.
> But given that you originally stumbled across this with Overlayfs,
> because it was originally using s_uuid, and that didn't work well for
> btrfs, why not change overlayfs to just use s_uuid plus kdev_t in its
> xattr, and just fix the problem for overlayfs? That has the benefit
> that it will work for all file system types in Linux, not just for
> those where we have changed what f_fsid does.
Using `kdev_t` (or any derivation of it) for persistent storage, such
as Overlayfs xattrs, is problematic. Since `kdev_t` is transient and
inconsistent across reboots or device re-discovery, it could lead to
broken associations.
It seems we've reached the functional limits of f_fsid.
If we want to solve this properly for Overlayfs, NFS handles, or a
complex system monitoring..etc, we need a new identifier let's call
it f_fsid_v2, that meets the following requirements:
System-wide Uniqueness: Must distinguish between cloned filesystems.
Persistence: Must remain consistent across reboots/HW re-enumeration.
Non-On-Disk: Must not be stored on-disk.
One possible implementation for f_fsid_v2 could be:
f_fsid_v2 = hash(s_uuid, block_device_serial, [subvol_id])
For pseudo block devices (virtio-blk, loop, nbd, brd,..),
the serial could be derived recursively:
serial_number = hash(backing_file.f_fsid_v2, backing_file.ino)
Note on Hardware Serials:
Standard storage protocols (T10, NVMe, SAS) mandate unique,
persistent serials per LUN. While I've seen T10 protocol
violations during my time authoring Solaris HBA drivers, I
believe these outliers shouldn't dictate the design.
This approach provides a system-wide unique ID that is persistent
without being stored on-disk. Effectively solving the cloned
filesystem identity crisis.
Thoughts ?
Thanks, Anand
next prev parent reply other threads:[~2026-04-16 15:21 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-21 11:55 [PATCH v2 0/3] fix s_uuid and f_fsid consistency for cloned filesystems Anand Jain
2026-03-21 11:55 ` [PATCH v2 1/3] btrfs: use on-disk uuid for s_uuid in temp_fsid mounts Anand Jain
2026-03-21 11:55 ` [PATCH v2 2/3] btrfs: derive f_fsid from on-disk fsuuid and dev_t Anand Jain
2026-03-21 11:55 ` [PATCH v2 3/3] ext4: derive f_fsid from block device to avoid collisions Anand Jain
2026-03-23 4:16 ` Theodore Tso
2026-03-23 15:29 ` Darrick J. Wong
2026-03-23 16:44 ` Darrick J. Wong
2026-03-25 10:02 ` Andreas Dilger
2026-03-25 10:59 ` Anand Jain
2026-03-25 12:59 ` Theodore Tso
2026-04-02 7:33 ` Anand Jain
2026-03-23 15:41 ` Anand Jain
2026-04-04 8:59 ` Anand Jain
2026-04-07 5:22 ` Christoph Hellwig
2026-04-07 14:47 ` Theodore Tso
2026-04-08 22:28 ` Anand Jain
2026-04-09 4:10 ` Theodore Tso
2026-04-09 9:45 ` Anand Jain
2026-04-09 13:12 ` Theodore Tso
2026-04-16 15:21 ` Anand Jain [this message]
2026-04-17 7:34 ` Christoph Hellwig
2026-04-22 11:39 ` Anand Jain
2026-04-23 5:08 ` Christoph Hellwig
2026-04-27 10:16 ` Anand Jain
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=b593ab17-afaf-4128-97eb-0ab9c23dec5c@gmail.com \
--to=anajain.sg@gmail.com \
--cc=asj@kernel.org \
--cc=djwong@kernel.org \
--cc=dsterba@suse.com \
--cc=hch@infradead.org \
--cc=linux-btrfs@vger.kernel.org \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-xfs@vger.kernel.org \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.