From: David Gibson <david@gibson.dropbear.id.au>
To: Yao Zi <ziyao@disroot.org>
Cc: devicetree-compiler@vger.kernel.org
Subject: Re: [RESEND PATCH] flattree: Optimize stringtable_insert()
Date: Tue, 15 Apr 2025 11:39:03 +1000 [thread overview]
Message-ID: <Z_24t5n2WnmsqXAg@zatzit> (raw)
In-Reply-To: <Z_z7IUsK1zbikIp2@pie>
[-- Attachment #1: Type: text/plain, Size: 2949 bytes --]
On Mon, Apr 14, 2025 at 12:10:09PM +0000, Yao Zi wrote:
> On Mon, Apr 14, 2025 at 04:24:23PM +1000, David Gibson wrote:
> > On Sat, Apr 12, 2025 at 10:03:52AM +0000, Yao Zi wrote:
> > > According to perf result, stringtable_insert() is one of the five hotest
> > > functions, which is obvious since it depends on a brute-force,
> > > quadratic-complexity method to deduplicate the string block and is
> > > called at creation of every property.
> >
> > Right. The optimization strategy of dtc and libfdt is pretty much "if
> > it's slow enough that it bothers someone, then we'll think about it".
> > Little effort has gone into optimization, in general, because device
> > trees are generally so small that runtime hasn't really been a
> > problem, even with very inefficient coding.
> >
> > > This patch optimizes the function in two ways,
> > >
> > > - Replace brute-force deduplication with libc-provided memmem(), which
> > > is guaranteed to be in linear complexity on both glibc on musl libc.
> > > This brings roughly 24.6% reduction in execution time.
> >
> > That said, this is a substantial improvement for a very simple change.
> > I'd be happy to apply this change, separated into its own patch.
> >
> > > - An open-addressing hashtable is maintained to track strings already
> > > inserted. As property names are likely to be duplicated, this could
> > > filter out many existing strings, avoiding traversing the string
> > > block. This reduces another 1.2% of execution time.
> >
> > Honestly the substantial complexity this adds doesn't seem worth it
> > for a mere 1.2% improvement.
>
> Thanks for the feedback. At first I'm only interested in improving the
> deduplication with a hashtable -- but later it's found that reusing
> subsequences is also necessary and memmem() itself surprisingly brings a
> huge improvement.
>
> I'm preparing for v2 where the hashtable is split out and also used for
> optimizing get_node_by_label(). Although the changes are not ready for
> reviewing now, the new optimization reduces executation by another
> 6.4%. Do you think it's a reason strong enough to keep the the code?
Probably not, unless you can convince me there's a use case where the
performance is a pressing practical problem.
> Here's a summary of the changes,
>
> Makefile.dtc | 1 +
> checks.c | 4 --
> dtc-parser.y | 4 +-
> dtc.c | 4 ++
> dtc.h | 19 +++++++
> flattree.c | 68 +++++++++++++++++++------
> hashtable.c | 141 +++++++++++++++++++++++++++++++++++++++++++++++++++
> livetree.c | 35 +++++++++++--
> 8 files changed, 251 insertions(+), 25 deletions(-)
>
> ...
>
>
> Best regards,
> Yao Zi
>
--
David Gibson (he or they) | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you, not the other way
| around.
http://www.ozlabs.org/~dgibson
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2025-04-15 1:39 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-04-12 10:03 [RESEND PATCH] flattree: Optimize stringtable_insert() Yao Zi
2025-04-12 12:45 ` Simon Glass
2025-04-14 3:27 ` Yao Zi
2025-04-14 6:24 ` David Gibson
2025-04-14 12:10 ` Yao Zi
2025-04-15 1:39 ` David Gibson [this message]
2025-04-15 4:03 ` Yao Zi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Z_24t5n2WnmsqXAg@zatzit \
--to=david@gibson.dropbear.id.au \
--cc=devicetree-compiler@vger.kernel.org \
--cc=ziyao@disroot.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).