From: Luis Henriques <luis@igalia.com>
To: Ilya Dryomov <idryomov@gmail.com>
Cc: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>,
"ceph-devel@vger.kernel.org" <ceph-devel@vger.kernel.org>,
Xiubo Li <xiubli@redhat.com>,
"pengpeng@iscas.ac.cn" <pengpeng@iscas.ac.cn>,
"slava@dubeyko.com" <slava@dubeyko.com>,
Alex Markuze <amarkuze@redhat.com>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v4] ceph: bound encrypted snapshot suffix formatting
Date: Mon, 27 Apr 2026 09:12:42 +0100 [thread overview]
Message-ID: <87qzo0keit.fsf@igalia.com> (raw)
In-Reply-To: <CAOi1vP8AC3eKeW7rbNmmZdt8aPAhf9AuCEBxbzYDkfn4Mq76gg@mail.gmail.com> (Ilya Dryomov's message of "Fri, 24 Apr 2026 21:20:23 +0200")
On Fri, Apr 24 2026, Ilya Dryomov wrote:
> On Fri, Apr 24, 2026 at 8:31 PM Viacheslav Dubeyko
> <Slava.Dubeyko@ibm.com> wrote:
>>
>> On Fri, 2026-04-24 at 11:27 +0200, Ilya Dryomov wrote:
>> > On Thu, Apr 23, 2026 at 8:04 PM Viacheslav Dubeyko
>> > <Slava.Dubeyko@ibm.com> wrote:
>> > >
>> > > On Wed, 2026-04-22 at 11:53 +0200, Ilya Dryomov wrote:
>> > > > On Fri, Apr 10, 2026 at 10:46 PM Viacheslav Dubeyko
>> > > > <Slava.Dubeyko@ibm.com> wrote:
>> > > > >
>> > > > > On Fri, 2026-04-10 at 20:40 +0000, Viacheslav Dubeyko wrote:
>> > > > > > On Thu, 2026-04-09 at 18:09 +0000, Viacheslav Dubeyko wrote:
>> > > > > > > On Thu, 2026-04-09 at 10:39 +0800, Pengpeng Hou wrote:
>> > > > > > > > ceph_encode_encrypted_dname() base64-encodes the encrypted snapshot
>> > > > > > > > name into the caller buffer and then, for long snapshot names, appends
>> > > > > > > > _<ino> with sprintf(p + elen, ...).
>> > > > > > > >
>> > > > > > > > Some callers only provide NAME_MAX bytes. For long snapshot names, a
>> > > > > > > > large inode suffix can push the final encoded name past NAME_MAX even
>> > > > > > > > though the encrypted prefix stayed within the documented 240-byte
>> > > > > > > > budget.
>> > > > > > > >
>> > > > > > > > Format the suffix into a small local buffer first and reject names
>> > > > > > > > whose suffix would exceed the caller's NAME_MAX output buffer.
>> > > > > > > >
>> > > > > > > > Signed-off-by: Pengpeng Hou <pengpeng@iscas.ac.cn>
>> > > > > > > > ---
>> > > > > > > > Changes since v3:
>> > > > > > > > - reject `elen > 240` explicitly instead of relying only on the earlier
>> > > > > > > > `WARN_ON()`
>> > > > > > > > - rewrite the NAME_MAX bound check in terms of the final total length
>> > > > > > > > instead of `NAME_MAX - prefix_len - elen`
>> > > > > > > >
>> > > > > > > > fs/ceph/crypto.c | 31 +++++++++++++++++++++++++++++--
>> > > > > > > > 1 file changed, 29 insertions(+), 2 deletions(-)
>> > > > > > > >
>> > > > > > > > diff --git a/fs/ceph/crypto.c b/fs/ceph/crypto.c
>> > > > > > > > index f3de43ccb470..42e3fff34697 100644
>> > > > > > > > --- a/fs/ceph/crypto.c
>> > > > > > > > +++ b/fs/ceph/crypto.c
>> > > > > > > > @@ -15,6 +15,12 @@
>> > > > > > > > #include "mds_client.h"
>> > > > > > > > #include "crypto.h"
>> > > > > > > >
>> > > > > > > > +/*
>> > > > > > > > + * Reserve room for '_' + decimal 64-bit inode number + trailing NUL.
>> > > > > > > > + * ceph_encode_encrypted_dname() copies only the visible suffix bytes.
>> > > > > > > > + */
>> > > > > > > > +#define CEPH_ENCRYPTED_SNAP_INO_SUFFIX_MAX sizeof("_18446744073709551615")
>> > > > > > > > +
>> > > > > > > > static int ceph_crypt_get_context(struct inode *inode, void *ctx, size_t len)
>> > > > > > > > {
>> > > > > > > > struct ceph_inode_info *ci = ceph_inode(inode);
>> > > > > > > > @@ -209,6 +215,7 @@ int ceph_encode_encrypted_dname(struct inode *parent, char *buf, int elen)
>> > > > > > > > struct inode *dir = parent;
>> > > > > > > > char *p = buf;
>> > > > > > > > u32 len;
>> > > > > > > > + int prefix_len = 0;
>> > > > > > > > int name_len = elen;
>> > > > > > > > int ret;
>> > > > > > > > u8 *cryptbuf = NULL;
>> > > > > > > > @@ -219,6 +226,7 @@ int ceph_encode_encrypted_dname(struct inode *parent, char *buf, int elen)
>> > > > > > > > if (IS_ERR(dir))
>> > > > > > > > return PTR_ERR(dir);
>> > > > > > > > p++; /* skip initial '_' */
>> > > > > > > > + prefix_len = 1;
>> > > > > > > > }
>> > > > > > > >
>> > > > > > > > if (!fscrypt_has_encryption_key(dir))
>> > > > > > > > @@ -271,8 +279,27 @@ int ceph_encode_encrypted_dname(struct inode *parent, char *buf, int elen)
>> > > > > > > >
>> > > > > > > > /* To understand the 240 limit, see CEPH_NOHASH_NAME_MAX comments */
>> > > > > > > > WARN_ON(elen > 240);
>> > > > > > > > - if (dir != parent) // leading _ is already there; append _<inum>
>> > > > > > > > - elen += 1 + sprintf(p + elen, "_%ld", dir->i_ino);
>> > > > > > > > + if (elen > 240) {
>> > > > > > > > + elen = -ENAMETOOLONG;
>> > > > > > > > + goto out;
>> > > > > > > > + }
>> > > > > > > > +
>> > > > > > > > + if (dir != parent) {
>> > > > > > > > + int total_len;
>> > > > > > > > + /* leading '_' is already there; append _<inum> */
>> > > > > > > > + char suffix[CEPH_ENCRYPTED_SNAP_INO_SUFFIX_MAX];
>> > > > > > > > +
>> > > > > > > > + ret = snprintf(suffix, sizeof(suffix), "_%lu", dir->i_ino);
>> > > > > > > > + total_len = prefix_len + elen + ret;
>> > > > > > > > + if (total_len > NAME_MAX) {
>> > > > > > > > + elen = -ENAMETOOLONG;
>> > > > > > > > + goto out;
>> > > > > > > > + }
>> > > > > > > > +
>> > > > > > > > + memcpy(p + elen, suffix, ret);
>> > > > > > > > + /* Include the leading '_' skipped by p. */
>> > > > > > > > + elen = total_len;
>> > > > > > > > + }
>> > > > > > > >
>> > > > > > > > out:
>> > > > > > > > kfree(cryptbuf);
>> > > > > > >
>> > > > > > > Looks good.
>> > > > > > >
>> > > > > > > Reviewed-by: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>
>> > > > > > >
>> > > > > > > Let me run xfstests for the patch to double check that everything is OK. I'll
>> > > > > > > share the result ASAP.
>> > > > > > >
>> > > > > >
>> > > > > > The xfstests run was successful. I don't see any issues with the patch.
>> > > > > >
>> > > > > > Tested-by: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>
>> > > > > >
>> > > > > >
>> > > > >
>> > > > > Applied on testing branch of CephFS kernel client git tree.
>> > > >
>> > > > Hi Pengpeng, Slava,
>> > > >
>> > > > This patch raised my attention because my understanding was that the
>> > > > entire CEPH_NOHASH_NAME_MAX + sha256() was put in place precisely to
>> > > > handle longer names nicely and make them fit into NAME_MAX-sized buffer.
>> > > > Simply rejecting longer names seemed to be in direct contradiction with
>> > > > that and yet the patch on its own was clearly merited given
>> > > >
>> > > > * (240 bytes is the maximum size allowed for snapshot names to take into
>> > > > * account the format: '_<SNAPSHOT-NAME>_<INODE-NUMBER>'.)
>> > > >
>> > > > comment on CEPH_NOHASH_NAME_MAX definition.
>> > > >
>> > > > I dug a bit deeper and started a discussion in [1]. The preliminary
>> > > > conclusion is that the 240 bytes assumption was a mistake -- somehow
>> > > > the minimum number of characters needed for <inum> ended up being used
>> > > > instead of the maximum. CEPH_NOHASH_NAME_MAX value is likely incorrect
>> > > > and should have been smaller -- something along the lines of 174 -
>> > > > SHA256_DIGEST_SIZE instead of 180 - SHA256_DIGEST_SIZE.
>> > > >
>> > >
>> > > The limitation could be 240 or bigger one, but anyway we need to process this
>> > > limitation in proper way. And this patch has exactly this goal. Is 240 bytes
>> > > limitation your concern? As far as I can see, this patch doesn't introduce this
>> > > limitation. It was there before this modification. We can extend this limitation
>> > > anytime.
>> >
>> > Hi Slava,
>> >
>> > My take on this is that there shouldn't be a limitation to begin with
>> > (other than NAME_MAX which is natural and applies universally, not just
>> > to snapshot names). This function has a bunch of code that is there
>> > specifically to handle longer names and avoid any artificial limits:
>> > the part of the name that spills over CEPH_NOHASH_NAME_MAX is hashed
>> > and the whole thing is set up in such a way that the end result is
>> > never bigger than 240 bytes. 240 isn't a random number -- it was
>> > picked on purpose to leave room for _ prefix and _<INODE-NUMBER> suffix
>> > (255 1 - 1 - 13) but a mistake appears to have crept in. Instead of
>> > accounting for the maximum possible <INODE-NUMBER> length (which is 20
>> > in decimal encoding), it accounted only for 13 (likely because it
>> > happens to be the maximum possible length in a single-MDS setup?).
>> >
>> > IMO the right course of action here would be to see if the hashing
>> > parameters can be adjusted, not introduce new ENAMETOOLONG errors.
>> >
>> >
>>
>> Hi Pengpeng,
>>
>> Could you please rework the patch taking into account the shared remarks?
>
> I would hold on until the discussion in [1] comes to conclusion. It
> turns out that the userspace client doesn't encrypt snapshot names
> anymore, so another (much worse, at least IMO) route would be to drop
> this bit of functionality from the kernel client as well [2].
OK, let me see if I understood this correctly:
- The user-space client implementation leaks metadata (the snaphsot names)
- The kernel client doesn't leak the snapshots names, thought there are
bugs to be fixed (including on the MDS side)
- The proposed solution is to drop the kernel snapshot names encryption.
So, currently snapshots created on the kernel client can't be accessed
from the user-space client, and vice-versa?
Also, won't dropping the encryption from the kernel client effectively
make old snapshots created using a kernel client unusable?
Cheers,
--
Luís
>
> [1] https://github.com/ceph/ceph/pull/45312
> [2] https://tracker.ceph.com/issues/76257
>
> Thanks,
>
> Ilya
next prev parent reply other threads:[~2026-04-27 8:12 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-03 8:56 [PATCH] ceph: bound encrypted snapshot suffix formatting Pengpeng Hou
2026-04-06 18:52 ` Viacheslav Dubeyko
2026-04-07 1:57 ` [PATCH v2] " Pengpeng Hou
2026-04-07 19:42 ` Viacheslav Dubeyko
2026-04-08 0:57 ` [PATCH v3] " Pengpeng Hou
2026-04-08 18:34 ` Viacheslav Dubeyko
2026-04-09 2:39 ` [PATCH v4] " Pengpeng Hou
2026-04-09 18:09 ` Viacheslav Dubeyko
2026-04-10 20:40 ` Viacheslav Dubeyko
2026-04-10 20:46 ` Viacheslav Dubeyko
2026-04-22 9:53 ` Ilya Dryomov
2026-04-23 18:04 ` Viacheslav Dubeyko
2026-04-24 9:27 ` Ilya Dryomov
2026-04-24 18:31 ` Viacheslav Dubeyko
2026-04-24 19:20 ` Ilya Dryomov
2026-04-27 8:12 ` Luis Henriques [this message]
2026-04-28 11:40 ` Ilya Dryomov
2026-04-28 18:17 ` Viacheslav Dubeyko
2026-04-24 8:15 ` Luis Henriques
2026-04-07 3:30 ` [PATCH] " Pengpeng Hou
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87qzo0keit.fsf@igalia.com \
--to=luis@igalia.com \
--cc=Slava.Dubeyko@ibm.com \
--cc=amarkuze@redhat.com \
--cc=ceph-devel@vger.kernel.org \
--cc=idryomov@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=pengpeng@iscas.ac.cn \
--cc=slava@dubeyko.com \
--cc=xiubli@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox