git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Philippe Blain <levraiphilippeblain@gmail.com>
To: Glen Choo <chooglen@google.com>, git@vger.kernel.org
Subject: Re: [RFC] Branches with --recurse-submodules
Date: Thu, 11 Nov 2021 22:19:18 -0500	[thread overview]
Message-ID: <bb9c0094-8532-c463-47a2-442b225ad52e@gmail.com> (raw)
In-Reply-To: <kl6lv912uvjv.fsf@chooglen-macbookpro.roam.corp.google.com>

Hi Glen,

Le 2021-11-08 à 17:33, Glen Choo a écrit :
> 
> Original Submodule UX RFC/Discussion:
> https://lore.kernel.org/git/YHofmWcIAidkvJiD@google.com/
> 
> Contributor Summit submodules Notes:
> https://lore.kernel.org/git/nycvar.QRO.7.76.6.2110211148060.56@tvgsbejvaqbjf.bet/
> 
> Submodule UX overhaul updates:
> https://lore.kernel.org/git/?q=Submodule+UX+overhaul+update
> 
> Hi all! Building on Emily’s original RFC, here is a more fleshed out
> vision of how `git {switch,checkout,branch}` will work with
> submodule-native branches.
> 
> The "Background" section reframes the justification and mental model
> behind our proposed workflow in more explicit terms (see "Submodule UX
> RFC:Overview"). The "Design" section presents the rules we are using to
> implement "Submodule UX RFC:Detailed Design", and how certain corner
> cases should be handled.
> 
> I’d appreciate any and all feedback :) In particular, readers may be
> interested in the "dirty worktree" approach behind `git switch`. If
> anything stands out as good, bad or missing, do let us know. Thanks!
> 
> == Background
> 
> The purpose of this effort is to bring the benefits of branches to
> superprojects. In Git, branches are used to name and track progress;
> submodules are used to incorporate other repos. However, because of how
> submodules are tracked by superprojects, submodules usually operate in
> detached HEAD and the benefits of branches are lost. For users
> uncomfortable with detached HEAD, this workflow seems risky and
> unintuitive. Other users may still prefer branches because they can have
> branch reflog and they can be confident that submodule work is being
> tracked by some branch and won’t be gc-ed.
> 
> The main ideas are:
> 
> * there is a single set of branch names that are used throughout the
>    repo tree
> * progress can be made on submodules and/or the superproject without
>    requiring a gitlink update on the superproject
> * the user can switch between branches like they would for a
>    non-submodule-using repo.
> 
> We do not require the branches to move in lockstep, thus this UX may be
> suboptimal for logical monorepos that are implemented as submodules.
> 
> == Design
> 
> This design uses the same branch name in the superproject and
> submodules; a user who sees the branch `topic` in the superproject and
> submodules knows that they are the same logical thing. Commands with
> --recurse-submodules maintain the invariant that branches in the
> superproject and submodules are {read,created,modified,deleted}
> together.
> 
> e.g.
> 
> * `git branch --recurse-submodules topic` should create the branch
>    `topic` in each of the repositories.

I guess for some workflow this would be the good, but for others you might
not need to create submodule branches for each new superproject branch you
create.  I think I pointed that out before; I don't necessarily think that
creating branches in all submodules should *not* be the default behaviour,
but I think that it should be configurable. I mean that if I have 'submodule.recurse'
set to true, I would not like 'git branch topic' to create a 'topic' branch
in each submodule. So I wish I'll be able to add 'branch.recurseSubmodules = false'
to my config (or something similar) to have more granularity in behaviour.

Also, I assume the new behaviour would carry over to 'git checkout -b' and
'git switch -c' ?

> * `git switch --recurse-submodules topic` should checkout the branch
>    `topic` in each of the repositories

Nit: I guess you also include 'git checkout --r topic' here also ?

> 
> In a superproject-submodule relationship there is some ambiguity in what
> ‘checkout the branch `topic`’ should mean (does the submodule use its
> topic branch, or the version recorded in the superproject’s gitlink?).
> Our approach is to preserve existing semantics where reasonable - the
> ref name refers to the superproject’s ref, just as it does without
> --recurse-submodules.
> 
> One wrinkle is that a user can act on submodules _without_ going through
> the superproject (e.g. by cd-ing into the submodule), thus the branch
> tips may not match the expected commits in the superproject or the set
> of submodules branches may not match the set of superproject branches.
> As such, submodule branch names are resolved on a best-effort basis:
> 
> * If the submodule branch commit matches the one in the superproject, we
>    can safely use the submodule branch.

That makes sense.

> * If the branch is in an unexpected state, we either:
> ** Fallback to the version that the user would expect (if it is safe to
>      do so).

What would be 'the version the user would expect' here ? checking out the 'topic' branch
in the submodule, even if it's ahead of the commit recorded in the superproject ?
(it could even be behind, if the submodule branch was manually resetted). Or falling
back to the old behaviour of checking out the commit recorded in the superproject
on a detached HEAD ? I think I would prefer that, with sufficient warning.

> ** Reject the operation (if it is not safe).
> 
> As we expand submodule branches to other commands (merge, rebase,
> reset), the notions of ‘unexpected state’ and ‘safety’ become
> increasingly nebulous and difficult to define because they depend on the
> command being run. To manage this, we will start by supporting submodule
> branching under a limited set of circumstances and try to loosen them in
> the future. We will manage the user’s expectations by warning them if
> Git detects an unexpected state.
> 
> The proposed rules for submodule branching are as follows:
> 
> === Switching _from_ a branch `topic`, i.e. `git {switch,checkout}`
> 
> Check `topic` if each submodule’s worktree is clean (except for
> gitlinks), and has one of the following checked out:
> 
> * `topic`
> * the commit id in the superproject gitlink

I'm not sure what you mean here, if we are switching away from 'topic',
why do we want to checkout 'topic' ? (assuming "out" is missing from your sentence above).

Or maybe you really mean "check" ? But then I'm not sure either what you mean...


Re-reading it, and your next email, maybe that should read:

> === Switching _away from_ a branch `topic`, i.e. `git {switch,checkout} other-branch`
> 
> Checkout `other-branch` if each submodule’s worktree is clean (except for
> gitlinks), and has one of the following checked out:
> 
> * `topic`
> * the commit id in the superproject gitlink at the tip of 'topic'

Is that what you meant ? (that would indeed make sense).

> 
> This allows the user to switch with a dirty worktree (with respect to
> the superproject). We consider this acceptable because the submodule
> commits are tracked by the submodule branch. This is helpful when a user
> needs to switch branches before they are ready to commit to the
> superproject.
> 
> === Switching _to_ a branch `topic`, i.e. `git {switch,checkout} topic`
> 
> Switch to `topic` in the superproject. Then in each submodule, switch to:
> 
> * `topic`, if it exists
> * Otherwise, the commit id in the superproject gitlink (and warn the
>    user that HEAD is detached)
> 
> If the submodule `topic` points to a different commit from the
> superproject gitlink, this will leave the superproject with a dirty
> worktree with respect to the gitlinks. This allows a user to recover
> work if they had previously switched _away from_ "topic".

OK, so you seem to answer my interrogation above about "what is the version
the user would expect ?" with "the commit at the tip of 'topic' in the submodule,
if that branch exists.".

> 
> If a dirty worktree is unacceptable, we may need an option that is
> guaranteed to check out the superproject’s `topic`.

Yes, I would think that should be configurable, maybe something like
'--recurse-submodules=branch' vs '--recurse-submodules=detached' (which
is the actual behaviour). Just thinking out loud here.

> 
> === Creating a branch `topic`, i.e. `git branch topic start-point`
> 
> Check each submodule at the superproject’s `start-point` (not the
> submodule’s `start-point`) for the following:
> 
> * The submodule is initialized (in .git/modules)

The submodule should also be active, no ? Maybe it was cloned before,
so exists in .git/modules, but was then set as inactive (submodule.<name>.active=false)...

> * `topic` is a valid branch name
> 
> If so, create `topic` in the superproject and submodules based on the
> superproject’s `start-point`. Else, do not create any `topic` branches
> and guide the user towards a possible fix:
> 
> * A --force option that will move the branch tip to the commit in the
>    superproject. This will let the user overwrite the history of `topic`.
> * An --ignore option that ignores the existing `topic` branch. If used,
>    `git switch topic` would result in a dirty worktree.
> * (If needed) An --adopt option that creates a new superproject commit
>    that points to the existing submodule `topic` branch. This will let
>    the user checkout `topic` without ending up with a dirty worktree.
> * For uninitialized submodules, prompt them to initialize it via git
>    checkout start-point && git submodule update (we are working to
>    eliminate manual initialization in the long run, so this will become
>    obsolete eventually).

I think if the submodule are not initialized, they should be left alone, without
prompting the user. Projects that use non-optional submodules already instruct
their users to clone with --recurse-submodules or run 'git submodule update --init --recursive' after the clone, so I'm not sure that sort of nagging would be necessary...


Cheers,

Philippe.  

  parent reply	other threads:[~2021-11-12  3:19 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-11-08 22:33 [RFC] Branches with --recurse-submodules Glen Choo
2021-11-10 18:21 ` Glen Choo
2021-11-10 18:35   ` rsbecker
2021-11-10 19:35     ` Glen Choo
2021-11-10 20:25       ` rsbecker
2021-11-11 18:12         ` Glen Choo
2021-11-12  3:22   ` Philippe Blain
2021-11-12  3:19 ` Philippe Blain [this message]
2021-11-15 19:03   ` Glen Choo
2021-11-23 18:36     ` Philippe Blain
2021-11-23 19:04       ` Glen Choo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bb9c0094-8532-c463-47a2-442b225ad52e@gmail.com \
    --to=levraiphilippeblain@gmail.com \
    --cc=chooglen@google.com \
    --cc=git@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).