From: Anand Jain <anajain.sg@gmail.com>
To: Theodore Tso <tytso@mit.edu>,
Christoph Hellwig <hch@infradead.org>,
"Darrick J. Wong" <djwong@kernel.org>
Cc: linux-ext4@vger.kernel.org, linux-btrfs@vger.kernel.org,
linux-xfs@vger.kernel.org, Anand Jain <asj@kernel.org>
Subject: Re: [PATCH v2 3/3] ext4: derive f_fsid from block device to avoid collisions
Date: Sat, 4 Apr 2026 16:59:08 +0800 [thread overview]
Message-ID: <5bda3d00-df35-4ea1-b313-2fef6e5c5682@gmail.com> (raw)
In-Reply-To: <d4a9970b-e7ed-4e74-be9d-2d08400f9d79@gmail.com>
Hi Ted, Christoph, Darrick,
As I prepare v3, I'd appreciate your final thoughts on the mount option
naming and its necessity for ext4.
For the new option, I am considering:
-o nodup_f_fsid
-o unique_f_fsid
Context:
Currently, ext4's f_fsid is consistent across reboots but fails to be
unique when dealing with cloned filesystems (sharing the same UUID). Per
statfs(2) [1], the primary requirement is that the (f_fsid, ino) pair
uniquely identifies a file. The man page makes no explicit guarantee
regarding consistency across mount cycles or reboots.
Proposal:
With this fix, f_fsid becomes f(uuid, dev_t). This ensures OS-wide
uniqueness and maintains consistency as long as the underlying dev_t
remains stable.
Dilemma:
While statfs(2) [1] suggests f_fsid is "some random stuff," we know
userspace (NFS, systemd) often treats it as a persistent handle.
Do you prefer one of the names above, or is there a more idiomatic ext4
naming convention I should follow?
Given the ambiguity in the man page, is gating this behind an -o option
necessary, or should we consider making uniqueness the default behavior?
[1]
----------
statfs(2)
<snap>
Nobody knows what f_fsid is supposed to contain (but see below).
<snap>
The f_fsid field
Solaris, Irix, and POSIX have a system call statvfs(2)
that returns a struct statvfs (defined in <sys/statvfs.h>) containing
an unsigned long f_fsid. Linux, SunOS, HP-UX, 4.4BSD have a system
call statfs() that returns a struct statfs (defined in <sys/vfs.h>)
containing a fsid_t f_fsid, where fsid_t is defined as struct { int
val[2]; }. The same holds for FreeBSD, except that it uses the
include file <sys/mount.h>.
The general idea is that f_fsid contains some random stuff such
that the pair (f_fsid,ino) uniquely determines a file. Some operating
systems use (a variation on) the device number, or the device number
combined with the filesystem type. Several operating systems restrict
giving out the f_fsid field to the superuser only (and zero it for
unprivileged users), because this field is used in the filehandle of
the filesystem when NFS-exported, and giving it out is a security concern.
Under some operating systems, the fsid can be used as the second
argument to the sysfs(2) system call.
----------
Thanks, Anand
next prev parent reply other threads:[~2026-04-04 8:59 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-21 11:55 [PATCH v2 0/3] fix s_uuid and f_fsid consistency for cloned filesystems Anand Jain
2026-03-21 11:55 ` [PATCH v2 1/3] btrfs: use on-disk uuid for s_uuid in temp_fsid mounts Anand Jain
2026-03-21 11:55 ` [PATCH v2 2/3] btrfs: derive f_fsid from on-disk fsuuid and dev_t Anand Jain
2026-03-21 11:55 ` [PATCH v2 3/3] ext4: derive f_fsid from block device to avoid collisions Anand Jain
2026-03-23 4:16 ` Theodore Tso
2026-03-23 15:29 ` Darrick J. Wong
2026-03-23 16:44 ` Darrick J. Wong
2026-03-25 10:02 ` Andreas Dilger
2026-03-25 10:59 ` Anand Jain
2026-03-25 12:59 ` Theodore Tso
2026-04-02 7:33 ` Anand Jain
2026-03-23 15:41 ` Anand Jain
2026-04-04 8:59 ` Anand Jain [this message]
2026-04-07 5:22 ` Christoph Hellwig
2026-04-07 14:47 ` Theodore Tso
2026-04-08 22:28 ` Anand Jain
2026-04-09 4:10 ` Theodore Tso
2026-04-09 9:45 ` Anand Jain
2026-04-09 13:12 ` Theodore Tso
2026-04-16 15:21 ` Anand Jain
2026-04-17 7:34 ` Christoph Hellwig
2026-04-22 11:39 ` Anand Jain
2026-04-23 5:08 ` Christoph Hellwig
2026-04-27 10:16 ` Anand Jain
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5bda3d00-df35-4ea1-b313-2fef6e5c5682@gmail.com \
--to=anajain.sg@gmail.com \
--cc=asj@kernel.org \
--cc=djwong@kernel.org \
--cc=hch@infradead.org \
--cc=linux-btrfs@vger.kernel.org \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-xfs@vger.kernel.org \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.