Linux filesystem development
 help / color / mirror / Atom feed
From: David Laight <david.laight.linux@gmail.com>
To: Viacheslav Dubeyko <slava@dubeyko.com>
Cc: Kees Cook <kees@kernel.org>,
	linux-hardening@vger.kernel.org, linux-fsdevel@vger.kernel.org,
	linux-kernel@vger.kernel.org, Arnd Bergmann <arnd@kernel.org>,
	John Paul Adrian Glaubitz <glaubitz@physik.fu-berlin.de>,
	Yangtao Li <frank.li@vivo.com>
Subject: Re: [PATCH next] fs/hfsplus/xattr: Use memcpy() and strscpy() to build xattr_name
Date: Thu, 11 Jun 2026 09:09:29 +0100	[thread overview]
Message-ID: <20260611090929.54e388c5@pumpkin> (raw)
In-Reply-To: <5572678c0db875297d80a1401ed2567326bf50ad.camel@dubeyko.com>

On Wed, 10 Jun 2026 18:05:02 -0700
Viacheslav Dubeyko <slava@dubeyko.com> wrote:

> On Wed, 2026-06-10 at 10:09 +0100, David Laight wrote:
> > On Tue, 09 Jun 2026 18:04:46 -0700
> > Viacheslav Dubeyko <slava@dubeyko.com> wrote:
> >   
> > > On Mon, 2026-06-08 at 10:55 +0100,
> > > david.laight.linux@gmail.com wrote:  
> > > > From: David Laight <david.laight.linux@gmail.com>
> > > > 
> > > > xattr_name is kmalloc()ed at the (assumed) maximal size and then
> > > > the
> > > > prefix
> > > > and name concatenated together.
> > > > Use memcpy() for the prefix - its length is passed and strscpy()
> > > > for
> > > > the
> > > > name to ensure it really doesnt overflow.
> > > > 
> > > > Prior to bf29e886b242c the buffers were smaller and on-stack.
> > > > (But I cant see the copy in the old code.)
> > > > I am also not sure why the buffer isnt created "just long
> > > > enough".    
> > > 
> > > What do you mean by "the buffer isn't created just long enough"?
> > > And
> > > what is your vision of correct implementation of the logic?  
> > 
> > The old code is:  
> > >  	xattr_name = kmalloc(xattr_name_len, GFP_KERNEL);
> > >  	if (!xattr_name)
> > >  		return -ENOMEM;
> > > 	strcpy(xattr_name, prefix);
> > > 	strcpy(xattr_name + prefixlen, name);  
> > 
> > It would be more usual to do:
> > 	size_t name_len = strlen(name) + 1;
> > 	xattr_name = kmalloc(prefixlen + name_len);
> > 	memcpy(xattr_name, prefix, prefixlen)
> > 	memcpy(xattr_name + prefixlen, name, name_len);
> > So that the buffer is allocated the correct size for the strings.
> > 
> > (Not that it really makes much difference whether you do kmalloc(32)
> > or
> > kmalloc(1024) for a temporary buffer - both come of a per-cpu list.)  
> 
> The xattr name cannot contain more than 127 symbols [1]. So, the buffer
> of fixed size is allocated to keep any potential string inside of this
> limit no matter of size of the prefix and name. This is because we must
> have string that not longer that 127 symbols. If it is longer than
> that, then something is going wrong. Following to your logic, there is
> no big difference to allocate the buffer for 127 symbols or for precise
> length of the name. So, your suggestion doesn't make sense to me.

This code has no idea of the number of symbols.
Assuming that the callers haven't passed something that is overlong
is just asking for trouble - especially since it doesn't matter to
this code.
The caller is unlikely to know the length of the prefix - so may be
passing in a 'name' that is nearly 127 symbols long.
So adding the prefix can generate something longer than 127 symbols.
The called code has to handle/detect that.

> 
> > 
> > What I couldn't decide, but seems to be inferred from some of fixes,
> > was whether a double-terminator was needed.
> > (That might just be the attribute value.)
> > One of the related functions got fixed to zero fill a buffer in a
> > loop
> > because (AFICT) something was reading beyond a '\0' terminator.  
> 
> The rest is dedicated to Unicode peculiarities, as far as I can see.

Eh? a UTF-8 encoded buffer is terminated by a single '\0'.
The changes that stopped 'old values' being used on a later loop
iteration seems to be fixed by zeroing after the '\0', but neither
the code change nor commit message makes it clear what was fixed.

> 
> > 
> > This code does seem strange though.
> > It seems to add a text prefix here and then the called function does
> > string compares to find out which prefix has added and than act
> > differently based on the prefix.
> > I didn't try to follow things any further.  
> 
> There is no strange thing here. Different prefixes are processed by
> different logic.

So why do string compares for the prefixes?

> 
> > 
> > I also don't know if 'name' itself can be 127 wide characters (before
> > the
> > prefix is added)?
> > The fixed length buffer isn't long enough if all 127 characters
> > require
> > 6 bytes to encode.  
> 
> The whole string cannot be bigger than 127 symbols.

'cannot' or 'must not'?
As I said above where is the actual guarantee that the limit isn't exceeded.

	David

> 
> Thanks,
> Slava.
> 
> [1] https://elixir.bootlin.com/linux/v7.1-rc6/source/include/linux/hfs_common.h#L78


  reply	other threads:[~2026-06-11  8:09 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-08  9:55 [PATCH next] fs/hfsplus/xattr: Use memcpy() and strscpy() to build xattr_name david.laight.linux
2026-06-10  1:04 ` Viacheslav Dubeyko
2026-06-10  9:09   ` David Laight
2026-06-11  1:05     ` Viacheslav Dubeyko
2026-06-11  8:09       ` David Laight [this message]
2026-06-11  3:50 ` Viacheslav Dubeyko
2026-06-11  4:18   ` Darrick J. Wong
2026-06-11  8:18   ` David Laight

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260611090929.54e388c5@pumpkin \
    --to=david.laight.linux@gmail.com \
    --cc=arnd@kernel.org \
    --cc=frank.li@vivo.com \
    --cc=glaubitz@physik.fu-berlin.de \
    --cc=kees@kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-hardening@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=slava@dubeyko.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox