From: Richard Purdie <richard.purdie@linuxfoundation.org>
To: Michael Siebold <michael.siebold@gmail.com>,
Yoann Congal <yoann.congal@smile.fr>
Cc: bitbake-devel@lists.openembedded.org,
Philip Lorenz <philip.lorenz@bmw.de>
Subject: Re: [PATCH 3/3] bitbake: fetch2: Fix LFS object checkout in submodules
Date: Mon, 09 Mar 2026 21:32:58 +0000 [thread overview]
Message-ID: <69ad2307918818d02c42eba53787f404f8fc0fc6.camel@linuxfoundation.org> (raw)
In-Reply-To: <20260309212125.3172717-4-michael.siebold@gmail.com>
On Mon, 2026-03-09 at 14:21 -0700, Michael Siebold wrote:
> From: Philip Lorenz <philip.lorenz@bmw.de>
>
> Skipping smudging prevents the LFS objects from replacing their
> placeholder files when `git submodule update` actually checks out the
> target revision in the submodule. Smudging cannot happen earlier as the
> clone stored in `.git/modules` is bare.
>
> This should be fine as long as all LFS objects are available in the
> download cache (which they are after the other fixes are applied).
>
> (Bitbake rev: d270e33a07c50bb9c08861cf9a6dc51e1fd2d874)
>
> Upstream-Status: Backport [from commit 3eeac69385]
>
> Signed-off-by: Philip Lorenz <philip.lorenz@bmw.de>
> Signed-off-by: Richard Purdie <richard.purdie@linuxfoundation.org>
> (cherry picked from commit 3eeac69385e8f29a08d022a17b28b5d504deed66)
> Signed-off-by: Michael Siebold <michael.siebold@gmail.com>
> ---
> bitbake/lib/bb/fetch2/gitsm.py | 11 +++++------
> 1 file changed, 5 insertions(+), 6 deletions(-)
>
> diff --git a/bitbake/lib/bb/fetch2/gitsm.py b/bitbake/lib/bb/fetch2/gitsm.py
> index 5c98991480..ef19053330 100644
> --- a/bitbake/lib/bb/fetch2/gitsm.py
> +++ b/bitbake/lib/bb/fetch2/gitsm.py
> @@ -243,12 +243,11 @@ class GitSM(Git):
> ret = self.process_submodules(ud, ud.destdir, unpack_submodules, d)
>
> if not ud.bareclone and ret:
> - # All submodules should already be downloaded and configured in the tree. This simply
> - # sets up the configuration and checks out the files. The main project config should
> - # remain unmodified, and no download from the internet should occur. As such, lfs smudge
> - # should also be skipped as these files were already smudged in the fetch stage if lfs
> - # was enabled.
> - runfetchcmd("GIT_LFS_SKIP_SMUDGE=1 %s submodule update --recursive --no-fetch" % (ud.basecmd), d, quiet=True, workdir=ud.destdir)
> + cmdprefix = ""
> + # Avoid LFS smudging (replacing the LFS pointers with the actual content) when LFS shouldn't be used but git-lfs is installed.
> + if not self._need_lfs(ud):
> + cmdprefix = "GIT_LFS_SKIP_SMUDGE=1 "
> + runfetchcmd("%s%s submodule update --recursive --no-fetch" % (cmdprefix, ud.basecmd), d, quiet=True, workdir=ud.destdir)
> def clean(self, ud, d):
> def clean_submodule(ud, url, module, modpath, workdir, d):
> url += ";bareclone=1;nobranch=1"
We've had a lot of churn on this code and it isn't something I use and
fully understand myself so I need to ask some questions to make sure we
get this right this time.
Is "git submodule update --recursive --no-fetch" going to access the
network?
If I understand correctly, you say it shouldn't as things should
already be in DL_DIR. What happens if they're not? Where are the large
files stored in DL_DIR?
From the older comments in the code, it sounds like the smudging was
meant to happen at do_fetch time and this is now being changed to
happen at do_unpack.
Put differently, the fetcher code needs to:
* ensure software manifests are correct and only specifically
referenced things are fetched, no random revisions or accesses outside
of what is listed
* ensure mirroring works correctly and all artefacts needed (including
lfs ones) can be handled by a mirror setting
* be reproducible, the same thing will always be fetched for a given
url/revision
I'd like to be certain this change allows for that and the smudging
doesn't bypass things.
Also, do we have tests covering this from bitbake-selftest?
Cheers,
Richard
next prev parent reply other threads:[~2026-03-09 21:33 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-09 21:21 [scarthgap][PATCH 0/3] Fix git lfs submodule expansion Michael Siebold
2026-03-09 21:21 ` [PATCH 1/3] bitbake: gitsm: Add clean function Michael Siebold
2026-03-09 21:21 ` [PATCH 2/3] bitbake: fetch2: Fix incorrect lfs parametrization for submodules Michael Siebold
2026-03-09 21:21 ` [PATCH 3/3] bitbake: fetch2: Fix LFS object checkout in submodules Michael Siebold
2026-03-09 21:32 ` Richard Purdie [this message]
2026-03-09 23:14 ` Michael Siebold
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=69ad2307918818d02c42eba53787f404f8fc0fc6.camel@linuxfoundation.org \
--to=richard.purdie@linuxfoundation.org \
--cc=bitbake-devel@lists.openembedded.org \
--cc=michael.siebold@gmail.com \
--cc=philip.lorenz@bmw.de \
--cc=yoann.congal@smile.fr \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox