From: Patrick Steinhardt <ps@pks.im>
To: Taylor Blau <me@ttaylorr.com>
Cc: git@vger.kernel.org, Karthik Nayak <karthik.188@gmail.com>,
Jeff King <peff@peff.net>
Subject: Re: [PATCH v2 02/16] odb: move list of packfiles into `struct packfile_store`
Date: Tue, 2 Sep 2025 10:50:41 +0200 [thread overview]
Message-ID: <aLav4UAcfQjvNzMF@pks.im> (raw)
In-Reply-To: <aKz0/WNu/GRYh3/W@nand.local>
On Mon, Aug 25, 2025 at 07:42:53PM -0400, Taylor Blau wrote:
> On Thu, Aug 21, 2025 at 09:39:00AM +0200, Patrick Steinhardt wrote:
> > The object database tracks the list of packfiles it currently knows
> > about. With the introduction of the `struct packfile_store` we have a
> > better place to host this list though.
> >
> > Move the list accordingly. Extract the logic from `odb_clear()` that
> > knows to close all such packfiles and move it into the new subsystem, as
> > well.
>
> Not a comment on this patch itself, but as a meta-comment, I really
> appreciate you taking such an incremental approach here. The packfile
> machinery is quite fragile in my experience, so breaking it up into (what
> are so far) easily review-able chunks makes it much easier to build
> confidence in the correctness of these changes.
It certainly is fragile overall. I stared at code for way longer than I
really want to admit in some cases.
> > diff --git a/odb.c b/odb.c
> > index 34b70d0074..17a9135cbd 100644
> > --- a/odb.c
> > +++ b/odb.c
> > @@ -1038,16 +1038,7 @@ void odb_clear(struct object_database *o)
> >
> > INIT_LIST_HEAD(&o->packed_git_mru);
> > close_object_store(o);
> > -
> > - /*
> > - * `close_object_store()` only closes the packfiles, but doesn't free
> > - * them. We thus have to do this manually.
> > - */
> > - for (struct packed_git *p = o->packed_git, *next; p; p = next) {
> > - next = p->next;
> > - free(p);
> > - }
> > - o->packed_git = NULL;
> > + packfile_store_free(o->packfiles);
>
> Interesting. The movement of the for-loop here all looks correct to me.
> But I think the new packfile_store is creating a new awkardness here
> that we should consider.
>
> In existing implementation, all of the ->next pointers here point to
> heap locations that have already been free()'d. But that's OK, since
> they aren't reachable at the moment that we do "o-packed_store = NULL".
>
> Having a separate packfile_store changes that, since (from my reading of
> the code) o->packfiles will still be non-NULL even after calling
> odb_clear(), *and* those pointers will refer to free'd heap locations.
>
> That seems like a potential footgun to me. I think that we could either:
>
> * Change packfile_store_free() to take in an object_database pointer,
> and NULL out the ->packs pointer after free'ing all of the packfiles.
> That would make it more similar to the existing behavior.
>
> * Leave packfile_store_free() as-is, document that it does NOT clear
> out the top-level pointer, and so callers are encouraged to NULL it
> out themselves after calling it. Likewise, we should change
> odb_clear() to do:
>
> packfile_store_free(o->packfiles);
> o->packfiles = NULL;
>
> Let me know what you think.
Good point. I think it's unlikely to ever become a problem, but I don't
see a reason why we shouldn't NULL out `o->packfiles`, either. So I'll
do the second approach.
> > diff --git a/packfile.c b/packfile.c
> > index 8fbf1cfc2d..6478e4cc30 100644
> > --- a/packfile.c
> > +++ b/packfile.c
> > @@ -278,7 +278,7 @@ static int unuse_one_window(struct packed_git *current)
> >
> > if (current)
> > scan_windows(current, &lru_p, &lru_w, &lru_l);
> > - for (p = current->repo->objects->packed_git; p; p = p->next)
> > + for (p = current->repo->objects->packfiles->packs; p; p = p->next)
>
> Not a huge deal, but I do find "current->repo->objects->packfiles->packs"
> to be a bit unfortunate. I wonder if we should rename "packs" to "head"
> or "list_head" or similar since it's clear from
> "current->repo->objects->packfiles" that this is a list of packfiles.
I'd like to keep this part as-is for now if you don't mind. This is
mostly because I've got a follow-up patch series that _does_ introduce
`head` as part of making the `->next` pointer go away.
> > @@ -2344,5 +2339,23 @@ struct packfile_store *packfile_store_new(struct object_database *odb)
> >
> > void packfile_store_free(struct packfile_store *store)
> > {
> > + packfile_store_close(store);
>
> Seeing a call to packfile_store_close() here was a little surprising to
> me. The code that you are moving has a comment that says:
>
> * `close_object_store()` only closes the packfiles, but doesn't free
> * them. We thus have to do this manually.
>
> , so I would have expected to preserve that behavior.
This behaviour is preserved though. Calling `packfile_store_close()`
does not free the packfiles, it only closes them. And we continue to
call `packfile_store_close()` in `close_object_store()`, so nothing
changes.
The only change in behaviour is that we now also know to close packfiles
when freeing the packfile store.
> I *think* that
> this happens to be OK, since close_pack() is a noop if it is called more
> than once (though I had to double check through all of its leaf
> functions that that was indeed the case).
>
> I would probably strike this from the new function, since the sole
> caller above already calls close_object_store() before calling
> packfile_store_free().
Calling `packfile_store_close()` is idempotent indeed, so it shouldn't
be an issue to call this function twice. To me the question is whether
there's ever a use case where you want to free the packfile store, but
don't want to close the packfiles stored in it.
From all I've seen that is never the case, so I think it's sensible to
ensure that we always close the packfile store before we free it to make
things a tiny bit more self-contained.
> > +void packfile_store_close(struct packfile_store *store)
> > +{
> > + struct packed_git *p;
> > +
> > + for (p = store->packs; p; p = p->next)
> > + if (p->do_not_close)
> > + BUG("want to close pack marked 'do-not-close'");
> > + else
> > + close_pack(p);
> > +}
>
> And likewise this looks good to me. I do find the braceless for-loop a
> little hard to read, but it's (a) correct, and (b) consistent with the
> original implementation, so I don't feel strongly about changing it.
Agreed, it is a bit awkward. I feel like our coding style should be
amended to say that we only do braceless bodies in case the body is a
single statement.
> As a side-note, you could inline the declaration of "p" here into the
> for-loop, but I can understand not wanting to to make the diff more
> readable with --color-moved.
I wouldn't mind adapting this while at it, too.
Patrick
next prev parent reply other threads:[~2025-09-02 8:50 UTC|newest]
Thread overview: 181+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-08-19 8:19 [PATCH 00/16] packfile: carve out a new packfile store Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 01/16] packfile: introduce a new `struct packfile_store` Patrick Steinhardt
2025-08-19 9:47 ` Karthik Nayak
2025-08-20 4:58 ` Patrick Steinhardt
2025-08-19 17:32 ` Junio C Hamano
2025-08-20 4:58 ` Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 02/16] odb: move list of packfiles into " Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 03/16] odb: move initialization bit " Patrick Steinhardt
2025-08-19 9:57 ` Karthik Nayak
2025-08-19 16:24 ` Junio C Hamano
2025-08-20 8:04 ` Karthik Nayak
2025-08-22 23:50 ` Junio C Hamano
2025-08-26 12:19 ` [PATCH] Documentation: note styling for bit fields Karthik Nayak
2025-08-20 4:58 ` [PATCH 03/16] odb: move initialization bit into `struct packfile_store` Patrick Steinhardt
2025-08-20 6:24 ` Junio C Hamano
2025-08-19 8:19 ` [PATCH 04/16] odb: move packfile map " Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 05/16] odb: move MRU list of packfiles " Patrick Steinhardt
2025-08-20 12:44 ` Karthik Nayak
2025-08-20 19:20 ` Jeff King
2025-08-21 6:40 ` Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 06/16] odb: move kept cache " Patrick Steinhardt
2025-08-19 18:56 ` Junio C Hamano
2025-08-20 4:58 ` Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 07/16] packfile: reorder functions to avoid function declaration Patrick Steinhardt
2025-08-19 19:18 ` Junio C Hamano
2025-08-19 8:19 ` [PATCH 08/16] packfile: refactor `prepare_packed_git()` to work on packfile store Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 09/16] packfile: split up responsibilities of `reprepare_packed_git()` Patrick Steinhardt
2025-08-20 13:17 ` Karthik Nayak
2025-08-19 8:19 ` [PATCH 10/16] packfile: refactor `install_packed_git()` to work on packfile store Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 11/16] packfile: always add packfiles to MRU when adding a pack Patrick Steinhardt
2025-08-20 13:35 ` Karthik Nayak
2025-08-19 8:19 ` [PATCH 12/16] packfile: introduce function to load and add packfiles Patrick Steinhardt
2025-08-20 13:41 ` Karthik Nayak
2025-08-21 6:40 ` Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 13/16] packfile: move `get_multi_pack_index()` into "midx.c" Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 14/16] packfile: remove `get_packed_git()` Patrick Steinhardt
2025-08-20 13:50 ` Karthik Nayak
2025-08-21 6:40 ` Patrick Steinhardt
2025-08-20 13:51 ` Karthik Nayak
2025-08-19 8:19 ` [PATCH 15/16] packfile: refactor `get_all_packs()` to work on packfile store Patrick Steinhardt
2025-08-20 13:53 ` Karthik Nayak
2025-08-21 6:40 ` Patrick Steinhardt
2025-08-19 8:19 ` [PATCH 16/16] packfile: refactor `get_packed_git_mru()` " Patrick Steinhardt
2025-08-19 17:13 ` [PATCH 00/16] packfile: carve out a new " Junio C Hamano
2025-08-20 13:55 ` Karthik Nayak
2025-08-21 7:38 ` [PATCH v2 " Patrick Steinhardt
2025-08-21 7:38 ` [PATCH v2 01/16] packfile: introduce a new `struct packfile_store` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 02/16] odb: move list of packfiles into " Patrick Steinhardt
2025-08-25 23:42 ` Taylor Blau
2025-09-02 8:50 ` Patrick Steinhardt [this message]
2025-09-02 17:21 ` Taylor Blau
2025-09-02 17:42 ` Junio C Hamano
2025-09-03 5:58 ` Patrick Steinhardt
2025-09-11 23:16 ` Taylor Blau
2025-09-15 7:44 ` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 03/16] odb: move initialization bit " Patrick Steinhardt
2025-08-26 1:40 ` Taylor Blau
2025-08-21 7:39 ` [PATCH v2 04/16] odb: move packfile map " Patrick Steinhardt
2025-08-26 1:41 ` Taylor Blau
2025-08-21 7:39 ` [PATCH v2 05/16] odb: move MRU list of packfiles " Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 06/16] odb: move kept cache " Patrick Steinhardt
2025-08-26 1:46 ` Taylor Blau
2025-09-02 8:50 ` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 07/16] packfile: reorder functions to avoid function declaration Patrick Steinhardt
2025-08-26 1:47 ` Taylor Blau
2025-08-21 7:39 ` [PATCH v2 08/16] packfile: refactor `prepare_packed_git()` to work on packfile store Patrick Steinhardt
2025-08-26 1:58 ` Taylor Blau
2025-08-21 7:39 ` [PATCH v2 09/16] packfile: split up responsibilities of `reprepare_packed_git()` Patrick Steinhardt
2025-08-26 2:10 ` Taylor Blau
2025-09-02 8:50 ` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 10/16] packfile: refactor `install_packed_git()` to work on packfile store Patrick Steinhardt
2025-08-26 2:11 ` Taylor Blau
2025-09-02 8:50 ` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 11/16] packfile: always add packfiles to MRU when adding a pack Patrick Steinhardt
2025-08-27 1:04 ` Taylor Blau
2025-09-02 8:50 ` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 12/16] packfile: introduce function to load and add packfiles Patrick Steinhardt
2025-08-27 1:12 ` Taylor Blau
2025-08-21 7:39 ` [PATCH v2 13/16] packfile: move `get_multi_pack_index()` into "midx.c" Patrick Steinhardt
2025-08-27 1:20 ` Taylor Blau
2025-08-21 7:39 ` [PATCH v2 14/16] packfile: remove `get_packed_git()` Patrick Steinhardt
2025-08-27 1:38 ` Taylor Blau
2025-09-02 8:50 ` Patrick Steinhardt
2025-09-11 23:25 ` Taylor Blau
2025-09-15 7:30 ` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 15/16] packfile: refactor `get_all_packs()` to work on packfile store Patrick Steinhardt
2025-08-27 1:45 ` Taylor Blau
2025-09-02 8:51 ` Patrick Steinhardt
2025-09-11 23:33 ` Taylor Blau
2025-09-15 7:44 ` Patrick Steinhardt
2025-08-21 7:39 ` [PATCH v2 16/16] packfile: refactor `get_packed_git_mru()` " Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 00/15] packfile: carve out a new " Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 01/15] packfile: introduce a new `struct packfile_store` Patrick Steinhardt
2025-09-09 7:49 ` Karthik Nayak
2025-09-02 10:48 ` [PATCH v3 02/15] odb: move list of packfiles into " Patrick Steinhardt
2025-09-09 8:00 ` Karthik Nayak
2025-09-09 11:09 ` Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 03/15] odb: move initialization bit " Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 04/15] odb: move packfile map " Patrick Steinhardt
2025-09-09 8:22 ` Karthik Nayak
2025-09-09 11:01 ` Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 05/15] odb: move MRU list of packfiles " Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 06/15] odb: move kept cache " Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 07/15] packfile: reorder functions to avoid function declaration Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 08/15] packfile: refactor `prepare_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 09/15] packfile: split up responsibilities of `reprepare_packed_git()` Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 10/15] packfile: refactor `install_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 11/15] packfile: introduce function to load and add packfiles Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 12/15] packfile: move `get_multi_pack_index()` into "midx.c" Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 13/15] packfile: remove `get_packed_git()` Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 14/15] packfile: refactor `get_all_packs()` to work on packfile store Patrick Steinhardt
2025-09-02 10:48 ` [PATCH v3 15/15] packfile: refactor `get_packed_git_mru()` " Patrick Steinhardt
2025-09-02 16:40 ` [PATCH v3 00/15] packfile: carve out a new " Junio C Hamano
2025-09-11 23:34 ` Taylor Blau
2025-09-09 9:33 ` Karthik Nayak
2025-09-09 11:02 ` [PATCH v4 " Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 01/15] packfile: introduce a new `struct packfile_store` Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 02/15] odb: move list of packfiles into " Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 03/15] odb: move initialization bit " Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 04/15] odb: move packfile map " Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 05/15] odb: move MRU list of packfiles " Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 06/15] odb: move kept cache " Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 07/15] packfile: reorder functions to avoid function declaration Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 08/15] packfile: refactor `prepare_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 09/15] packfile: split up responsibilities of `reprepare_packed_git()` Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 10/15] packfile: refactor `install_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 11/15] packfile: introduce function to load and add packfiles Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 12/15] packfile: move `get_multi_pack_index()` into "midx.c" Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 13/15] packfile: remove `get_packed_git()` Patrick Steinhardt
2025-09-11 23:37 ` Taylor Blau
2025-09-09 11:03 ` [PATCH v4 14/15] packfile: refactor `get_all_packs()` to work on packfile store Patrick Steinhardt
2025-09-09 11:03 ` [PATCH v4 15/15] packfile: refactor `get_packed_git_mru()` " Patrick Steinhardt
2025-09-10 7:35 ` [PATCH v4 00/15] packfile: carve out a new " Karthik Nayak
2025-09-11 23:40 ` Taylor Blau
2025-09-11 23:42 ` Taylor Blau
2025-09-15 7:25 ` Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 " Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 01/15] packfile: introduce a new `struct packfile_store` Patrick Steinhardt
2025-09-17 21:26 ` Justin Tobler
2025-09-23 9:34 ` Patrick Steinhardt
2025-09-24 21:56 ` Justin Tobler
2025-09-15 8:54 ` [PATCH v5 02/15] odb: move list of packfiles into " Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 03/15] odb: move initialization bit " Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 04/15] odb: move packfile map " Patrick Steinhardt
2025-09-17 22:15 ` Justin Tobler
2025-09-23 9:35 ` Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 05/15] odb: move MRU list of packfiles " Patrick Steinhardt
2025-09-17 21:59 ` Justin Tobler
2025-09-15 8:54 ` [PATCH v5 06/15] odb: move kept cache " Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 07/15] packfile: reorder functions to avoid function declaration Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 08/15] packfile: refactor `prepare_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 09/15] packfile: split up responsibilities of `reprepare_packed_git()` Patrick Steinhardt
2025-09-17 22:32 ` Justin Tobler
2025-09-23 9:34 ` Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 10/15] packfile: refactor `install_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 11/15] packfile: introduce function to load and add packfiles Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 12/15] packfile: move `get_multi_pack_index()` into "midx.c" Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 13/15] packfile: refactor `get_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 14/15] packfile: refactor `get_all_packs()` " Patrick Steinhardt
2025-09-15 8:54 ` [PATCH v5 15/15] packfile: refactor `get_packed_git_mru()` " Patrick Steinhardt
2025-09-23 10:16 ` [PATCH v6 00/15] packfile: carve out a new " Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 01/15] packfile: introduce a new `struct packfile_store` Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 02/15] odb: move list of packfiles into " Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 03/15] odb: move initialization bit " Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 04/15] odb: move packfile map " Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 05/15] odb: move MRU list of packfiles " Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 06/15] odb: move kept cache " Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 07/15] packfile: reorder functions to avoid function declaration Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 08/15] packfile: refactor `prepare_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 09/15] packfile: split up responsibilities of `reprepare_packed_git()` Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 10/15] packfile: refactor `install_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 11/15] packfile: introduce function to load and add packfiles Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 12/15] packfile: move `get_multi_pack_index()` into "midx.c" Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 13/15] packfile: refactor `get_packed_git()` to work on packfile store Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 14/15] packfile: refactor `get_all_packs()` " Patrick Steinhardt
2025-09-23 10:17 ` [PATCH v6 15/15] packfile: refactor `get_packed_git_mru()` " Patrick Steinhardt
2025-09-24 21:58 ` [PATCH v6 00/15] packfile: carve out a new " Justin Tobler
2025-09-25 16:08 ` Junio C Hamano
2025-09-26 5:26 ` Patrick Steinhardt
2025-09-28 22:05 ` Taylor Blau
2025-09-29 21:39 ` Patrick Steinhardt
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aLav4UAcfQjvNzMF@pks.im \
--to=ps@pks.im \
--cc=git@vger.kernel.org \
--cc=karthik.188@gmail.com \
--cc=me@ttaylorr.com \
--cc=peff@peff.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).