* [PATCH] ext4: Refactor breaking condition for xattr_find_entry()
@ 2025-07-08 2:00 I Hsin Cheng
2025-07-08 2:24 ` Theodore Ts'o
2025-07-19 21:45 ` Theodore Ts'o
0 siblings, 2 replies; 4+ messages in thread
From: I Hsin Cheng @ 2025-07-08 2:00 UTC (permalink / raw)
To: tytso
Cc: adilger.kernel, linux-ext4, linux-kernel, skhan,
linux-kernel-mentees, I Hsin Cheng
Refactor the condition for breaking the loop within xattr_find_entry().
Elimate the usage of "<=" and take condition shortcut when "!cmp" is
true.
Originally, the condition was "(cmp <= 0 && (sorted || cmp == 0))", which
means after it knows "cmp <= 0" is true, it has to check the value of
"sorted" and "cmp". The checking of "cmp" here would be redundant since
it has already checked it.
Observing from the logic, when "cmp == 0" the branch is going to be true,
no need to check "cmp == 0" again, so we only need to take shortcut when
"cmp == 0", on the other hand, we'll check "sorted" when "cmp < 0".
The refactor can shrink the generated code size by 44 bytes. Numerous
instructions can be saved thus should also benefit execution efficiency
as well.
$ ./scripts/bloat-o-meter vmlinux_old vmlinux_new
add/remove: 0/0 grow/shrink: 0/1 up/down: 0/-44 (-44)
Function old new delta
xattr_find_entry 300 256 -44
Total: Before=22989434, After=22989390, chg -0.00%
The test is done on kernel version 6.16 with x86_64 defconfig
and gcc 13.3.0.
Signed-off-by: I Hsin Cheng <richard120310@gmail.com>
---
fs/ext4/xattr.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/fs/ext4/xattr.c b/fs/ext4/xattr.c
index 8d15acbacc20..1993622e3c74 100644
--- a/fs/ext4/xattr.c
+++ b/fs/ext4/xattr.c
@@ -338,7 +338,7 @@ xattr_find_entry(struct inode *inode, struct ext4_xattr_entry **pentry,
cmp = name_len - entry->e_name_len;
if (!cmp)
cmp = memcmp(name, entry->e_name, name_len);
- if (cmp <= 0 && (sorted || cmp == 0))
+ if (!cmp || (cmp < 0 && sorted))
break;
}
*pentry = entry;
--
2.43.0
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [PATCH] ext4: Refactor breaking condition for xattr_find_entry()
2025-07-08 2:00 [PATCH] ext4: Refactor breaking condition for xattr_find_entry() I Hsin Cheng
@ 2025-07-08 2:24 ` Theodore Ts'o
2025-07-08 3:01 ` I Hsin Cheng
2025-07-19 21:45 ` Theodore Ts'o
1 sibling, 1 reply; 4+ messages in thread
From: Theodore Ts'o @ 2025-07-08 2:24 UTC (permalink / raw)
To: I Hsin Cheng
Cc: adilger.kernel, linux-ext4, linux-kernel, skhan,
linux-kernel-mentees
On Tue, Jul 08, 2025 at 10:00:13AM +0800, I Hsin Cheng wrote:
> diff --git a/fs/ext4/xattr.c b/fs/ext4/xattr.c
> index 8d15acbacc20..1993622e3c74 100644
> --- a/fs/ext4/xattr.c
> +++ b/fs/ext4/xattr.c
> @@ -338,7 +338,7 @@ xattr_find_entry(struct inode *inode, struct ext4_xattr_entry **pentry,
> cmp = name_len - entry->e_name_len;
> if (!cmp)
> cmp = memcmp(name, entry->e_name, name_len);
> - if (cmp <= 0 && (sorted || cmp == 0))
> + if (!cmp || (cmp < 0 && sorted))
This is *not* identical. Suppose memcmp returns a positive value
(say, 1). Previously, the conditional would be false. With your
change, !cmp would be true, so the overall conditional would be true.
So this does not appear to be a valid transformation.
(Note that valid transformations will be done by the compiler
automatically, without needing to make code changes.)
- Ted
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] ext4: Refactor breaking condition for xattr_find_entry()
2025-07-08 2:24 ` Theodore Ts'o
@ 2025-07-08 3:01 ` I Hsin Cheng
0 siblings, 0 replies; 4+ messages in thread
From: I Hsin Cheng @ 2025-07-08 3:01 UTC (permalink / raw)
To: Theodore Ts'o
Cc: adilger.kernel, linux-ext4, linux-kernel, skhan,
linux-kernel-mentees
On Mon, Jul 07, 2025 at 10:24:53PM -0400, Theodore Ts'o wrote:
> On Tue, Jul 08, 2025 at 10:00:13AM +0800, I Hsin Cheng wrote:
> > diff --git a/fs/ext4/xattr.c b/fs/ext4/xattr.c
> > index 8d15acbacc20..1993622e3c74 100644
> > --- a/fs/ext4/xattr.c
> > +++ b/fs/ext4/xattr.c
> > @@ -338,7 +338,7 @@ xattr_find_entry(struct inode *inode, struct ext4_xattr_entry **pentry,
> > cmp = name_len - entry->e_name_len;
> > if (!cmp)
> > cmp = memcmp(name, entry->e_name, name_len);
> > - if (cmp <= 0 && (sorted || cmp == 0))
> > + if (!cmp || (cmp < 0 && sorted))
>
> This is *not* identical. Suppose memcmp returns a positive value
> (say, 1). Previously, the conditional would be false. With your
> change, !cmp would be true, so the overall conditional would be true.
>
> So this does not appear to be a valid transformation.
>
> (Note that valid transformations will be done by the compiler
> automatically, without needing to make code changes.)
>
> - Ted
Hi Ted,
> This is *not* identical. Suppose memcmp returns a positive value
> (say, 1). Previously, the conditional would be false. With your
> change, !cmp would be true, so the overall conditional would be true.
I would argue that "!cmp" is only true when "cmp" is zero, otherwise
it'll be false no matter the number is positive or negative.
With some transformation according to Demorgan's Law, the following
expressions are equivalent
* "cmp <= 0 && (sorted || cmp == 0)"
* "(cmp <= 0 && sorted) || (cmp <= 0 && cmp == 0)"
* "(cmp <= 0 && sorted) || (cmp == 0)"
* "(cmp == 0) || (cmp <= 0 && sorted)"
Because when "cmp == 0" (which is "!cmp"), the condition is going to
take shortcut, so we can further simplify "(cmp <= 0 && sorted)" to
"(cmp < 0 && sorted)", since "cmp" isn't going to be 0 when entering
this part.
When you put any non-zero value for "cmp", "!cmp" is going to be false
so it will further check whether "(cmp < 0 && sorted)".
This is my derivation flow, let me know if there's anything wrong in it.
> (Note that valid transformations will be done by the compiler
> automatically, without needing to make code changes.)
Makes sense, thanks for the head up, but I think we do have some
benefits from it when compiling with -O2 optimization level?
As the bloat-o-meter indicates the code generation size can actually be
shrinked.
Best regards,
I Hsin Cheng
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] ext4: Refactor breaking condition for xattr_find_entry()
2025-07-08 2:00 [PATCH] ext4: Refactor breaking condition for xattr_find_entry() I Hsin Cheng
2025-07-08 2:24 ` Theodore Ts'o
@ 2025-07-19 21:45 ` Theodore Ts'o
1 sibling, 0 replies; 4+ messages in thread
From: Theodore Ts'o @ 2025-07-19 21:45 UTC (permalink / raw)
To: I Hsin Cheng
Cc: Theodore Ts'o, adilger.kernel, linux-ext4, linux-kernel,
skhan, linux-kernel-mentees
On Tue, 08 Jul 2025 10:00:13 +0800, I Hsin Cheng wrote:
> Refactor the condition for breaking the loop within xattr_find_entry().
> Elimate the usage of "<=" and take condition shortcut when "!cmp" is
> true.
>
> Originally, the condition was "(cmp <= 0 && (sorted || cmp == 0))", which
> means after it knows "cmp <= 0" is true, it has to check the value of
> "sorted" and "cmp". The checking of "cmp" here would be redundant since
> it has already checked it.
>
> [...]
Applied, thanks!
[1/1] ext4: Refactor breaking condition for xattr_find_entry()
commit: 9d9076238fe9fe45257f298bf51b35aa796cf0f1
Best regards,
--
Theodore Ts'o <tytso@mit.edu>
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2025-07-19 21:46 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-07-08 2:00 [PATCH] ext4: Refactor breaking condition for xattr_find_entry() I Hsin Cheng
2025-07-08 2:24 ` Theodore Ts'o
2025-07-08 3:01 ` I Hsin Cheng
2025-07-19 21:45 ` Theodore Ts'o
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).