From: Jeff King <peff@peff.net>
To: Taylor Blau <me@ttaylorr.com>
Cc: git@vger.kernel.org, Junio C Hamano <gitster@pobox.com>,
Elijah Newren <newren@gmail.com>, Patrick Steinhardt <ps@pks.im>
Subject: Re: [PATCH 05/16] midx: introduce `--checksum-only` for incremental MIDX writes
Date: Mon, 30 Mar 2026 19:15:47 -0400 [thread overview]
Message-ID: <20260330231547.GE41843@coredump.intra.peff.net> (raw)
In-Reply-To: <41cb5471bb96d38a1b9ae75d022e2c5356318939.1774820449.git.me@ttaylorr.com>
On Sun, Mar 29, 2026 at 05:41:03PM -0400, Taylor Blau wrote:
> When writing an incremental MIDX layer, the MIDX machinery writes the
> new layer into the multi-pack-index.d directory and then updates the
> multi-pack-index-chain file to include the freshly written layer.
>
> Future callers however may not wish to immediately update the MIDX chain
> itself, preferring instead to write out new layer(s) themselves before
> atomically updating the chain. Concretely, the new incremental
> MIDX-based repacking strategy will want to do exactly this (that is,
> assemble the new MIDX chain itself before writing a new chain file and
> atomically linking it into place).
>
> Introduce a `--checksum-only` flag that:
>
> * writes the new MIDX layer into the multi-pack-index.d directory
>
> * prints its checksum
>
> * does not update the multi-pack-index-chain file.
Your goal makes sense, but that is not at all what I would have expected
a "--checksum-only" option to do. From the name I expected it to just
compute or verify checksums and not write anything to disk. I dunno.
Maybe it should be "--no-write-chain-file" or something?
This is perhaps on the border of bike-shedding, and maybe the fact that
it is an option of "multi-pack-index write" would be enough to clarify.
> @@ -221,6 +233,9 @@ static int cmd_multi_pack_index_compact(int argc, const char **argv,
> MIDX_WRITE_BITMAP | MIDX_WRITE_REV_INDEX),
> OPT_BIT(0, "incremental", &opts.flags,
> N_("write a new incremental MIDX"), MIDX_WRITE_INCREMENTAL),
> + OPT_BIT(0, "checksum-only", &opts.flags,
> + N_("write a MIDX layer without updating the MIDX chain"),
> + MIDX_WRITE_CHECKSUM_ONLY),
> OPT_END(),
> };
>
> @@ -239,6 +254,15 @@ static int cmd_multi_pack_index_compact(int argc, const char **argv,
> if (argc != 2)
> usage_with_options(builtin_multi_pack_index_compact_usage,
> options);
> +
> + if (opts.flags & MIDX_WRITE_CHECKSUM_ONLY &&
> + !(opts.flags & MIDX_WRITE_INCREMENTAL)) {
> + error(_("cannot use %s without %s"),
> + "--checksum-only", "--incremental");
> + usage_with_options(builtin_multi_pack_index_compact_usage,
> + options);
> + }
> +
OK. It's sad that we have to duplicate these bits between the write and
compact operations, but I'm not sure how much sense it would make to
factor out common options (both how much work it is, and how much
duplication we could actually drop in practice).
> source = handle_object_dir_option(the_repository);
>
> FREE_AND_NULL(options);
> diff --git a/midx-write.c b/midx-write.c
> index 9f7d2bbf4cb..2c6905173ba 100644
> --- a/midx-write.c
> +++ b/midx-write.c
> @@ -1600,11 +1600,14 @@ static int write_midx_internal(struct write_midx_opts *opts)
> }
>
> if (ctx.incremental) {
> - struct strbuf lock_name = STRBUF_INIT;
> + if (!(opts->flags & MIDX_WRITE_CHECKSUM_ONLY)) {
> + struct strbuf lock_name = STRBUF_INIT;
>
> - get_midx_chain_filename(opts->source, &lock_name);
> - hold_lock_file_for_update(&lk, lock_name.buf, LOCK_DIE_ON_ERROR);
> - strbuf_release(&lock_name);
> + get_midx_chain_filename(opts->source, &lock_name);
> + hold_lock_file_for_update(&lk, lock_name.buf,
> + LOCK_DIE_ON_ERROR);
> + strbuf_release(&lock_name);
> + }
OK, we skip the chain-file write here (or least the start of it).
> @@ -1725,14 +1728,19 @@ static int write_midx_internal(struct write_midx_opts *opts)
> }
> strvec_init_alloc(&keep_hashes, keep_hashes_nr);
>
> + if (opts->flags & MIDX_WRITE_CHECKSUM_ONLY)
> + printf("%s\n", hash_to_hex_algop(midx_hash, r->hash_algo));
And then output the hash here instead.
Just thinking on the name I suggested, it implies the action from the
first hunk above, but not this one (whereas your name sort of does the
opposite). I don't think there is any reason to have _two_ options,
since you'd always want to do them together (or not at all). I still
like my suggestion better. ;)
> if (ctx.incremental) {
> - FILE *chainf = fdopen_lock_file(&lk, "w");
> struct strbuf final_midx_name = STRBUF_INIT;
> struct multi_pack_index *m = ctx.base_midx;
>
> - if (!chainf) {
> - error_errno(_("unable to open multi-pack-index chain file"));
> - goto cleanup;
> + if (!(opts->flags & MIDX_WRITE_CHECKSUM_ONLY)) {
> + FILE *chainf = fdopen_lock_file(&lk, "w");
> + if (!chainf) {
> + error_errno(_("unable to open multi-pack-index chain file"));
> + goto cleanup;
> + }
> }
OK, and this is the actual write which is skipped. I guess this could
also be conditional on is_lock_file_locked(&lk), assuming we initialize
it with LOCK_INIT. In some ways that seems easier to analyze to me,
since you don't have to realize that "lk" is only active when
CHECKSUM_ONLY is not set. But I don't think there's another reason we'd
choose not to take a lock file, so I think it may be six of one and half
a dozen of the other.
> @@ -1793,8 +1801,10 @@ static int write_midx_internal(struct write_midx_opts *opts)
> }
> }
>
> - for (uint32_t i = 0; i < keep_hashes_nr; i++)
> - fprintf(get_lock_file_fp(&lk), "%s\n", keep_hashes.v[i]);
> + if (!(opts->flags & MIDX_WRITE_CHECKSUM_ONLY))
> + for (uint32_t i = 0; i < keep_hashes_nr; i++)
> + fprintf(get_lock_file_fp(&lk), "%s\n",
> + keep_hashes.v[i]);
> } else {
> keep_hashes.v[ctx.num_multi_pack_indexes_before] =
> xstrdup(hash_to_hex_algop(midx_hash, r->hash_algo));
> @@ -1804,10 +1814,12 @@ static int write_midx_internal(struct write_midx_opts *opts)
> if (ctx.m || ctx.base_midx)
> odb_close(ctx.repo->objects);
>
> - if (commit_lock_file(&lk) < 0)
> - die_errno(_("could not write multi-pack-index"));
> + if (!(opts->flags & MIDX_WRITE_CHECKSUM_ONLY)) {
> + if (commit_lock_file(&lk) < 0)
> + die_errno(_("could not write multi-pack-index"));
>
> - clear_midx_files(opts->source, &keep_hashes, ctx.incremental);
> + clear_midx_files(opts->source, &keep_hashes, ctx.incremental);
> + }
> result = 0;
Ditto on these conditionals.
I'm out of time to keep reviewing, so I'll stop here for now. I'll try
to continue with the rest late tonight or tomorrow.
-Peff
next prev parent reply other threads:[~2026-03-30 23:15 UTC|newest]
Thread overview: 92+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-29 21:40 [PATCH 00/16] repack: incremental MIDX/bitmap-based repacking Taylor Blau
2026-03-29 21:40 ` [PATCH 01/16] midx-write: handle noop writes when converting incremental chains Taylor Blau
2026-03-30 22:33 ` Jeff King
2026-03-31 21:43 ` Taylor Blau
2026-03-29 21:40 ` [PATCH 02/16] midx: use `string_list` for retained MIDX files Taylor Blau
2026-03-30 22:38 ` Jeff King
2026-03-31 21:49 ` Taylor Blau
2026-03-29 21:40 ` [PATCH 03/16] strvec: introduce `strvec_init_alloc()` Taylor Blau
2026-03-30 22:46 ` Jeff King
2026-03-29 21:41 ` [PATCH 04/16] midx: use `strvec` for `keep_hashes` Taylor Blau
2026-03-30 23:01 ` Jeff King
2026-03-31 22:26 ` Taylor Blau
2026-03-31 22:50 ` Taylor Blau
2026-03-31 23:17 ` Jeff King
2026-04-01 15:41 ` Taylor Blau
2026-04-01 19:25 ` Jeff King
2026-03-29 21:41 ` [PATCH 05/16] midx: introduce `--checksum-only` for incremental MIDX writes Taylor Blau
2026-03-30 23:15 ` Jeff King [this message]
2026-04-02 22:51 ` Taylor Blau
2026-03-29 21:41 ` [PATCH 06/16] midx: support custom `--base` " Taylor Blau
2026-04-07 5:57 ` Jeff King
2026-04-14 22:09 ` Taylor Blau
2026-03-29 21:41 ` [PATCH 07/16] repack: track the ODB source via existing_packs Taylor Blau
2026-04-07 6:04 ` Jeff King
2026-04-14 22:24 ` Taylor Blau
2026-03-29 21:41 ` [PATCH 08/16] midx: expose `midx_layer_contains_pack()` Taylor Blau
2026-04-07 6:05 ` Jeff King
2026-03-29 21:41 ` [PATCH 09/16] repack-midx: factor out `repack_prepare_midx_command()` Taylor Blau
2026-03-29 21:41 ` [PATCH 10/16] repack-midx: extract `repack_fill_midx_stdin_packs()` Taylor Blau
2026-04-07 6:08 ` Jeff King
2026-03-29 21:41 ` [PATCH 11/16] repack-geometry: prepare for incremental MIDX repacking Taylor Blau
2026-04-07 6:10 ` Jeff King
2026-04-16 22:51 ` Elijah Newren
2026-04-21 19:34 ` Taylor Blau
2026-03-29 21:41 ` [PATCH 12/16] builtin/repack.c: convert `--write-midx` to an `OPT_CALLBACK` Taylor Blau
2026-04-07 6:18 ` Jeff King
2026-03-29 21:41 ` [PATCH 13/16] packfile: ensure `close_pack_revindex()` frees in-memory revindex Taylor Blau
2026-04-07 6:29 ` Jeff King
2026-03-29 21:41 ` [PATCH 14/16] repack: implement incremental MIDX repacking Taylor Blau
2026-04-16 22:53 ` Elijah Newren
2026-04-21 19:40 ` Taylor Blau
2026-03-29 21:41 ` [PATCH 15/16] repack: introduce `--write-midx=incremental` Taylor Blau
2026-04-16 22:53 ` Elijah Newren
2026-04-21 19:52 ` Taylor Blau
2026-03-29 21:41 ` [PATCH 16/16] repack: allow `--write-midx=incremental` without `--geometric` Taylor Blau
2026-04-14 22:38 ` [PATCH 00/16] repack: incremental MIDX/bitmap-based repacking Taylor Blau
2026-04-21 20:37 ` [PATCH v2 " Taylor Blau
2026-04-21 20:37 ` [PATCH v2 01/16] midx-write: handle noop writes when converting incremental chains Taylor Blau
2026-04-21 20:37 ` [PATCH v2 02/16] midx: use `strset` for retained MIDX files Taylor Blau
2026-04-21 20:37 ` [PATCH v2 03/16] midx: build `keep_hashes` array in order Taylor Blau
2026-04-21 20:37 ` [PATCH v2 04/16] midx: use `strvec` for `keep_hashes` Taylor Blau
2026-04-21 20:37 ` [PATCH v2 05/16] midx: introduce `--no-write-chain-file` for incremental MIDX writes Taylor Blau
2026-04-21 20:37 ` [PATCH v2 06/16] midx: support custom `--base` " Taylor Blau
2026-04-21 20:37 ` [PATCH v2 07/16] repack: track the ODB source via existing_packs Taylor Blau
2026-04-21 20:37 ` [PATCH v2 08/16] midx: expose `midx_layer_contains_pack()` Taylor Blau
2026-04-21 20:37 ` [PATCH v2 09/16] repack-midx: factor out `repack_prepare_midx_command()` Taylor Blau
2026-04-21 20:37 ` [PATCH v2 10/16] repack-midx: extract `repack_fill_midx_stdin_packs()` Taylor Blau
2026-04-29 8:08 ` Jeff King
2026-04-29 22:40 ` Taylor Blau
2026-04-21 20:37 ` [PATCH v2 11/16] repack-geometry: prepare for incremental MIDX repacking Taylor Blau
2026-04-21 20:37 ` [PATCH v2 12/16] builtin/repack.c: convert `--write-midx` to an `OPT_CALLBACK` Taylor Blau
2026-04-21 20:37 ` [PATCH v2 13/16] packfile: ensure `close_pack_revindex()` frees in-memory revindex Taylor Blau
2026-04-21 20:37 ` [PATCH v2 14/16] repack: implement incremental MIDX repacking Taylor Blau
2026-04-29 7:51 ` Jeff King
2026-04-29 23:36 ` Taylor Blau
2026-04-29 8:10 ` Jeff King
2026-04-29 23:39 ` Taylor Blau
2026-04-21 20:37 ` [PATCH v2 15/16] repack: introduce `--write-midx=incremental` Taylor Blau
2026-04-21 21:02 ` Taylor Blau
2026-04-21 20:38 ` [PATCH v2 16/16] repack: allow `--write-midx=incremental` without `--geometric` Taylor Blau
2026-04-22 14:45 ` [PATCH v2 00/16] repack: incremental MIDX/bitmap-based repacking Elijah Newren
2026-04-29 8:10 ` Jeff King
2026-04-30 0:13 ` [PATCH v3 " Taylor Blau
2026-04-30 0:13 ` [PATCH v3 01/16] midx-write: handle noop writes when converting incremental chains Taylor Blau
2026-04-30 0:13 ` [PATCH v3 02/16] midx: use `strset` for retained MIDX files Taylor Blau
2026-04-30 0:13 ` [PATCH v3 03/16] midx: build `keep_hashes` array in order Taylor Blau
2026-04-30 0:13 ` [PATCH v3 04/16] midx: use `strvec` for `keep_hashes` Taylor Blau
2026-04-30 0:13 ` [PATCH v3 05/16] midx: introduce `--no-write-chain-file` for incremental MIDX writes Taylor Blau
2026-04-30 0:13 ` [PATCH v3 06/16] midx: support custom `--base` " Taylor Blau
2026-04-30 0:13 ` [PATCH v3 07/16] repack: track the ODB source via existing_packs Taylor Blau
2026-04-30 0:13 ` [PATCH v3 08/16] midx: expose `midx_layer_contains_pack()` Taylor Blau
2026-04-30 0:13 ` [PATCH v3 09/16] repack-midx: factor out `repack_prepare_midx_command()` Taylor Blau
2026-05-13 21:45 ` SZEDER Gábor
2026-04-30 0:13 ` [PATCH v3 10/16] repack-midx: extract `repack_fill_midx_stdin_packs()` Taylor Blau
2026-04-30 0:13 ` [PATCH v3 11/16] repack-geometry: prepare for incremental MIDX repacking Taylor Blau
2026-04-30 0:13 ` [PATCH v3 12/16] builtin/repack.c: convert `--write-midx` to an `OPT_CALLBACK` Taylor Blau
2026-04-30 0:13 ` [PATCH v3 13/16] packfile: ensure `close_pack_revindex()` frees in-memory revindex Taylor Blau
2026-04-30 0:13 ` [PATCH v3 14/16] repack: implement incremental MIDX repacking Taylor Blau
2026-04-30 0:13 ` [PATCH v3 15/16] repack: introduce `--write-midx=incremental` Taylor Blau
2026-05-13 23:08 ` Jeff King
2026-04-30 0:13 ` [PATCH v3 16/16] repack: allow `--write-midx=incremental` without `--geometric` Taylor Blau
2026-05-01 6:46 ` [PATCH v3 00/16] repack: incremental MIDX/bitmap-based repacking Jeff King
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260330231547.GE41843@coredump.intra.peff.net \
--to=peff@peff.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=me@ttaylorr.com \
--cc=newren@gmail.com \
--cc=ps@pks.im \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox