From: Jeff King <peff@peff.net>
To: Junio C Hamano <gitster@pobox.com>
Cc: git@vger.kernel.org
Subject: Re: [RFC] cocci: .buf in a strbuf object can never be NULL
Date: Fri, 20 Mar 2026 02:18:52 -0400 [thread overview]
Message-ID: <20260320061852.GA35538@coredump.intra.peff.net> (raw)
In-Reply-To: <20260320055709.GA35291@coredump.intra.peff.net>
On Fri, Mar 20, 2026 at 01:57:09AM -0400, Jeff King wrote:
> I haven't measured to see what the exact cost is, but I know that
> looping over a strbuf (with a reset in the loop, or the implied reset
> from a getline call) is a common optimization trick that does have a
> measurable improvement for some cases.
Try something like this:
# input file is all objects in linux.git
cd linux.git
git repack -ad
git show-index <.git/objects/pack/pack-*.idx | cut -d' ' -f2 >input
# now do something that primarily reads a bunch of lines
git cat-file --buffer --batch-check='%(objectname)" <input
That's a somewhat silly command, though it does do something useful (it
checks that each object exists). Here is master ("git.old") versus
applying your patch ("git.new"):
Benchmark 1: ./git.old cat-file --buffer --batch-check="%(objectname)" <input
Time (mean ± σ): 3.152 s ± 0.057 s [User: 3.067 s, System: 0.085 s]
Range (min … max): 3.048 s … 3.225 s 10 runs
Benchmark 2: ./git.new cat-file --buffer --batch-check="%(objectname)" <input
Time (mean ± σ): 3.377 s ± 0.065 s [User: 3.295 s, System: 0.082 s]
Range (min … max): 3.279 s … 3.480 s 10 runs
Summary
./git.old cat-file --buffer --batch-check="%(objectname)" <input ran
1.07 ± 0.03 times faster than ./git.new cat-file --buffer --batch-check="%(objectname)" <input
That's a fairly extreme example, but I think shows that the extra
allocations do have measurable overhead. If you ask it do more work
(asking for %(objecttype) or something) the relative change becomes
smaller, but the absolute slowdown (a few hundred ms) remains.
-Peff
next prev parent reply other threads:[~2026-03-20 6:18 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-19 7:15 [PATCH] rerere: update to modern representation of empty strbufs Junio C Hamano
2026-03-19 7:57 ` Patrick Steinhardt
2026-03-19 22:14 ` [RFC] cocci: .buf in a strbuf object can never be NULL Junio C Hamano
2026-03-19 23:35 ` Jeff King
2026-03-20 1:46 ` Junio C Hamano
2026-03-20 4:18 ` Jeff King
2026-03-20 5:45 ` Junio C Hamano
2026-03-20 5:57 ` Jeff King
2026-03-20 6:06 ` Junio C Hamano
2026-03-20 6:18 ` Jeff King [this message]
2026-03-21 13:14 ` René Scharfe
2026-03-21 16:41 ` Jeff King
2026-03-21 20:47 ` René Scharfe
2026-03-21 21:18 ` Jeff King
2026-03-21 23:41 ` René Scharfe
2026-03-22 1:44 ` Jeff King
2026-03-22 1:22 ` Junio C Hamano
2026-03-22 1:40 ` Jeff King
2026-03-21 16:24 ` Junio C Hamano
2026-03-21 16:39 ` Jeff King
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260320061852.GA35538@coredump.intra.peff.net \
--to=peff@peff.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox