From: Richard Purdie <richard.purdie@linuxfoundation.org>
To: Michael Siebold <michael.siebold@gmail.com>,
Yoann Congal <yoann.congal@smile.fr>
Cc: bitbake-devel@lists.openembedded.org,
Philip Lorenz <philip.lorenz@bmw.de>
Subject: Re: [PATCH 3/3] bitbake: fetch2: Fix LFS object checkout in submodules
Date: Mon, 09 Mar 2026 21:32:58 +0000 [thread overview]
Message-ID: <69ad2307918818d02c42eba53787f404f8fc0fc6.camel@linuxfoundation.org> (raw)
In-Reply-To: <20260309212125.3172717-4-michael.siebold@gmail.com>
On Mon, 2026-03-09 at 14:21 -0700, Michael Siebold wrote:
> From: Philip Lorenz <philip.lorenz@bmw.de>
>
> Skipping smudging prevents the LFS objects from replacing their
> placeholder files when `git submodule update` actually checks out the
> target revision in the submodule. Smudging cannot happen earlier as the
> clone stored in `.git/modules` is bare.
>
> This should be fine as long as all LFS objects are available in the
> download cache (which they are after the other fixes are applied).
>
> (Bitbake rev: d270e33a07c50bb9c08861cf9a6dc51e1fd2d874)
>
> Upstream-Status: Backport [from commit 3eeac69385]
>
> Signed-off-by: Philip Lorenz <philip.lorenz@bmw.de>
> Signed-off-by: Richard Purdie <richard.purdie@linuxfoundation.org>
> (cherry picked from commit 3eeac69385e8f29a08d022a17b28b5d504deed66)
> Signed-off-by: Michael Siebold <michael.siebold@gmail.com>
> ---
> bitbake/lib/bb/fetch2/gitsm.py | 11 +++++------
> 1 file changed, 5 insertions(+), 6 deletions(-)
>
> diff --git a/bitbake/lib/bb/fetch2/gitsm.py b/bitbake/lib/bb/fetch2/gitsm.py
> index 5c98991480..ef19053330 100644
> --- a/bitbake/lib/bb/fetch2/gitsm.py
> +++ b/bitbake/lib/bb/fetch2/gitsm.py
> @@ -243,12 +243,11 @@ class GitSM(Git):
> ret = self.process_submodules(ud, ud.destdir, unpack_submodules, d)
>
> if not ud.bareclone and ret:
> - # All submodules should already be downloaded and configured in the tree. This simply
> - # sets up the configuration and checks out the files. The main project config should
> - # remain unmodified, and no download from the internet should occur. As such, lfs smudge
> - # should also be skipped as these files were already smudged in the fetch stage if lfs
> - # was enabled.
> - runfetchcmd("GIT_LFS_SKIP_SMUDGE=1 %s submodule update --recursive --no-fetch" % (ud.basecmd), d, quiet=True, workdir=ud.destdir)
> + cmdprefix = ""
> + # Avoid LFS smudging (replacing the LFS pointers with the actual content) when LFS shouldn't be used but git-lfs is installed.
> + if not self._need_lfs(ud):
> + cmdprefix = "GIT_LFS_SKIP_SMUDGE=1 "
> + runfetchcmd("%s%s submodule update --recursive --no-fetch" % (cmdprefix, ud.basecmd), d, quiet=True, workdir=ud.destdir)
> def clean(self, ud, d):
> def clean_submodule(ud, url, module, modpath, workdir, d):
> url += ";bareclone=1;nobranch=1"
We've had a lot of churn on this code and it isn't something I use and
fully understand myself so I need to ask some questions to make sure we
get this right this time.
Is "git submodule update --recursive --no-fetch" going to access the
network?
If I understand correctly, you say it shouldn't as things should
already be in DL_DIR. What happens if they're not? Where are the large
files stored in DL_DIR?
From the older comments in the code, it sounds like the smudging was
meant to happen at do_fetch time and this is now being changed to
happen at do_unpack.
Put differently, the fetcher code needs to:
* ensure software manifests are correct and only specifically
referenced things are fetched, no random revisions or accesses outside
of what is listed
* ensure mirroring works correctly and all artefacts needed (including
lfs ones) can be handled by a mirror setting
* be reproducible, the same thing will always be fetched for a given
url/revision
I'd like to be certain this change allows for that and the smudging
doesn't bypass things.
Also, do we have tests covering this from bitbake-selftest?
Cheers,
Richard
next prev parent reply other threads:[~2026-03-09 21:33 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-09 21:21 [scarthgap][PATCH 0/3] Fix git lfs submodule expansion Michael Siebold
2026-03-09 21:21 ` [PATCH 1/3] bitbake: gitsm: Add clean function Michael Siebold
2026-03-09 21:21 ` [PATCH 2/3] bitbake: fetch2: Fix incorrect lfs parametrization for submodules Michael Siebold
2026-03-09 21:21 ` [PATCH 3/3] bitbake: fetch2: Fix LFS object checkout in submodules Michael Siebold
2026-03-09 21:32 ` Richard Purdie [this message]
2026-03-09 23:14 ` Michael Siebold
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=69ad2307918818d02c42eba53787f404f8fc0fc6.camel@linuxfoundation.org \
--to=richard.purdie@linuxfoundation.org \
--cc=bitbake-devel@lists.openembedded.org \
--cc=michael.siebold@gmail.com \
--cc=philip.lorenz@bmw.de \
--cc=yoann.congal@smile.fr \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.