* git default behavior seems odd from a Unix command line point of view @ 2009-05-12 15:18 Andrew Schein 2009-05-12 16:12 ` Junio C Hamano 2009-05-12 16:24 ` Jeff King 0 siblings, 2 replies; 8+ messages in thread From: Andrew Schein @ 2009-05-12 15:18 UTC (permalink / raw) To: git Hi all - I am recently working within git after some experience with mercurial. I am observing what I believe to be an odd default behavior from the perspective of UNIX command line tools. I thought I'd share in case this hasn't occurred to git maintainers or in case somebody has developed good workaround practices. ais@ace:bio[1]$ pwd /home/ais/repo/nps/projects/bio ais@ace:bio$ git status # On branch master nothing to commit (working directory clean) ais@ace:bio[1]$ git commit -a # On branch master nothing to commit (working directory clean) ais@ace:bio[1]$ The [1] in my prompt indicates the exit code of the git commands. What I find odd is that even with the -q option, you get this verbose output. Also, you get a non-zero exit status (which I would expect only on a failure such as presence of an unresolved conflict). My git usage is to have a number of small repositories and use a shell script to loop over them and perform a sync with a centralized server. Having all this wordy output on a "no sync necessary" scenario seems counter the desired properties of output only when work is taking place or when an error occurs. Have others developed git practices to sync a bunch or repositories without all this verbose output on a "no change" scenario? Andy -- Andrew I. Schein www.andrewschein.com ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: git default behavior seems odd from a Unix command line point of view 2009-05-12 15:18 git default behavior seems odd from a Unix command line point of view Andrew Schein @ 2009-05-12 16:12 ` Junio C Hamano 2009-05-12 16:24 ` Andrew Schein 2009-05-12 16:24 ` Jeff King 1 sibling, 1 reply; 8+ messages in thread From: Junio C Hamano @ 2009-05-12 16:12 UTC (permalink / raw) To: Andrew Schein; +Cc: git Andrew Schein <andrew@andrewschein.com> writes: > ais@ace:bio$ git status > # On branch master > nothing to commit (working directory clean) > ais@ace:bio[1]$ git commit -a This is merely a historical accident that you cannot change. "git status" was originally a dual-purpose implementation detail of "git commit" command to: (1) report if there is anything you can commit with its exit code; and (2) if there is something to be committed, show what will and what won't be committed. For the purpose of (1), the calling command "commit" was told by the user to make a commit, so the situation where there is nothing to commit was signalled as an error. The code conceptually looked like: (git-commit.sh) do the preparation if git-status then # message "there is nothing to commit" already given revert the preparation exit 1 fi really make a commit These days, "git commit" has been rewritten and the logic to see if there is anything to commit is built-in, but people still use "git status" primarily for purpose (2), the listing of what's committed and what's left. Oh, and your example is wrong. If you are going to commit with -a, and if you want to see if such a "git commit" has actually something to commit, then you should be giving -a to status as well, like: git status -a && git commit -a Also, unless you ask explicitly, "git commit" will not make an empty commit, so you can always omit the "git status -a &&" part and have "git commit" itself do the checking. ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: git default behavior seems odd from a Unix command line point of view 2009-05-12 16:12 ` Junio C Hamano @ 2009-05-12 16:24 ` Andrew Schein 2009-05-12 16:34 ` Jeff King 2009-05-12 18:26 ` Daniel Barkalow 0 siblings, 2 replies; 8+ messages in thread From: Andrew Schein @ 2009-05-12 16:24 UTC (permalink / raw) To: Junio C Hamano; +Cc: git Hi Junio - Thanks for your reply giving historical context. The command line examples I gave were intended to give examples of the output rather than my own usage pattern. Here is my actual usage pattern... from a file called "sync.sh" (would love feedback on whether this is the gitly? way to solve my use case): # environment set up occurs before loop, I pull before "pushing" in an attempt to prevent # conflicts from being left on the shared repository. for dir in ./* ; do if [ ! -d $dir ] ; then continue ; fi #not a directory if [ ! -e $dir/.git ] ; then continue ; fi #not a git repo dir=`basename $dir` echo "syncing: $dir" set +e # commit returns an error if there is nothing to commit. (cd ./$dir ; git commit -a) set -e (cd ./$dir ; git pull $UP "$REPO/$dir" master) #pull ssh $HOST "mkdir -p $DEST_CACHE/$LOC/$dir" # these three lines handle "push" rsync -rl --delete ./$dir/.git -e ssh "$DEST:$DEST_CACHE/$LOC/$dir/.git" ssh $DEST "(cd $LOC/$dir ; /tools/bin/git pull $DEST_CACHE/$LOC/$dir/.git master)" done ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: git default behavior seems odd from a Unix command line point of view 2009-05-12 16:24 ` Andrew Schein @ 2009-05-12 16:34 ` Jeff King 2009-05-12 18:26 ` Daniel Barkalow 1 sibling, 0 replies; 8+ messages in thread From: Jeff King @ 2009-05-12 16:34 UTC (permalink / raw) To: Andrew Schein; +Cc: Junio C Hamano, git On Tue, May 12, 2009 at 12:24:56PM -0400, Andrew Schein wrote: > # environment set up occurs before loop, I pull before "pushing" in an > attempt to prevent > # conflicts from being left on the shared repository. > for dir in ./* ; do > if [ ! -d $dir ] ; then continue ; fi #not a directory > if [ ! -e $dir/.git ] ; then continue ; fi #not a git repo > dir=`basename $dir` > echo "syncing: $dir" > set +e # commit returns an error if there is nothing to commit. > (cd ./$dir ; git commit -a) > set -e One trick to avoid playing with "set -e" is just: thing_whose_error_you_want_ignore || true But that aside, it might be nice _not_ to ignore the result from "commit -a", since it could also be warning you that it failed to correctly commit. You probably want to do: git diff --quiet HEAD || git commit -a That is "either there is nothing to commit, or we succeed in committing everything". That of course has a race condition between the two commands, though (i.e., if you modify files in between). If you want something truly atomic, I think we would need "git commit" to distinguish error codes between "I had nothing to commit" and "an error occurred" (returning "1" right now could mean either). > (cd ./$dir ; git pull $UP "$REPO/$dir" master) #pull > ssh $HOST "mkdir -p $DEST_CACHE/$LOC/$dir" # these three lines > handle "push" > rsync -rl --delete ./$dir/.git -e ssh "$DEST:$DEST_CACHE/$LOC/$dir/.git" > ssh $DEST "(cd $LOC/$dir ; /tools/bin/git pull > $DEST_CACHE/$LOC/$dir/.git master)" Pushing directly should be a lot more efficient in the case of repacking (git will realize that the remote already has certain objects, but if the filenames change, rsync will not). -Peff ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: git default behavior seems odd from a Unix command line point of view 2009-05-12 16:24 ` Andrew Schein 2009-05-12 16:34 ` Jeff King @ 2009-05-12 18:26 ` Daniel Barkalow 2009-05-12 20:05 ` Andrew Schein 1 sibling, 1 reply; 8+ messages in thread From: Daniel Barkalow @ 2009-05-12 18:26 UTC (permalink / raw) To: Andrew Schein; +Cc: Junio C Hamano, git On Tue, 12 May 2009, Andrew Schein wrote: > Hi Junio - > > Thanks for your reply giving historical context. The command line > examples I gave were intended to give examples of the output rather > than my own usage pattern. > > Here is my actual usage pattern... from a file called "sync.sh" > (would love feedback on whether this is the gitly? way to solve my use > case): What *is* your use case? What you're doing seems nuts to me (like, you're going to send out files with this script that someone is in the middle of editting), but I don't know what you're trying to do. > # environment set up occurs before loop, I pull before "pushing" in an > attempt to prevent > # conflicts from being left on the shared repository. > for dir in ./* ; do > if [ ! -d $dir ] ; then continue ; fi #not a directory > if [ ! -e $dir/.git ] ; then continue ; fi #not a git repo > dir=`basename $dir` > echo "syncing: $dir" > set +e # commit returns an error if there is nothing to commit. > (cd ./$dir ; git commit -a) > set -e > (cd ./$dir ; git pull $UP "$REPO/$dir" master) #pull > ssh $HOST "mkdir -p $DEST_CACHE/$LOC/$dir" # these three lines > handle "push" > rsync -rl --delete ./$dir/.git -e ssh "$DEST:$DEST_CACHE/$LOC/$dir/.git" > ssh $DEST "(cd $LOC/$dir ; /tools/bin/git pull > $DEST_CACHE/$LOC/$dir/.git master)" > > done What are you trying to avoid by not using "git push"? Why are you committing whatever changes happen to be in working directories? Are you intending to be able to handle non-trivial merges? -Daniel *This .sig left intentionally blank* ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: git default behavior seems odd from a Unix command line point of view 2009-05-12 18:26 ` Daniel Barkalow @ 2009-05-12 20:05 ` Andrew Schein 2009-05-12 20:50 ` Daniel Barkalow 0 siblings, 1 reply; 8+ messages in thread From: Andrew Schein @ 2009-05-12 20:05 UTC (permalink / raw) To: Daniel Barkalow, git > What *is* your use case? What you're doing seems nuts to me (like, you're > going to send out files with this script that someone is in the middle of > editting), but I don't know what you're trying to do. I am new to git... so my first instinct is to try to reproduce a work flow that I know works with mercurial setup. It is possible that the concepts don't translate correctly. Here goes... I have a bunch of separate project-related repositories. There are very few users of the system. Most of the time I am the only user. I want a system for syncing my local repositories to a single shared repository. For example some days I work on my laptop, and some days from my desktop. A third "shared/public" repository "on campus" serves as an always available repository that anyone I collaborate with can pull from. Also it is backed up, and for this reason I designate it the "shared" version. So the purpose of the sync.sh script is to synchronize the personal laptop/desktop repository to the on-campus version. Something I have learned from using mercurial in industry is that when somebody messes up a "public repo" with conflicts they frequently don't clean up the mess. This can be a sign that they have not learned the lessons of cleanliness rather than ill intent. Otherwise (and similarly) this messiness can be caused from not noticing that they have left a mess. The motivation of having a sync script that is run on each user's local repository is to decrease the likelihood of a mess. This is achieved by first pulling from the common repository and resolving conflicts _before_ "pushing" (note quotations) their changes to the common repository. There is a possibility of a race condition that leaves a conflict on the shared repository, however the risk is diminished. Finally, I use "push" in quotes because actually the script uses only uses the pull command. This prevents proliferation of branches on the shared repository. Is there a better way to achieve this in git than the sync.sh script I sent around? Thanks, Andy -- Andrew I. Schein www.andrewschein.com ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: git default behavior seems odd from a Unix command line point of view 2009-05-12 20:05 ` Andrew Schein @ 2009-05-12 20:50 ` Daniel Barkalow 0 siblings, 0 replies; 8+ messages in thread From: Daniel Barkalow @ 2009-05-12 20:50 UTC (permalink / raw) To: Andrew Schein; +Cc: git On Tue, 12 May 2009, Andrew Schein wrote: > > What *is* your use case? What you're doing seems nuts to me (like, you're > > going to send out files with this script that someone is in the middle of > > editting), but I don't know what you're trying to do. > > I am new to git... so my first instinct is to try to reproduce a work > flow that I know works with mercurial setup. It is possible that the > concepts don't translate correctly. Here goes... > > I have a bunch of separate project-related repositories. There are > very few users of the system. Most of the time I am the only user. I > want a system for syncing my local repositories to a single shared > repository. For example some days I work on my laptop, and some days > from my desktop. A third "shared/public" repository "on campus" > serves as an always available repository that anyone I collaborate > with can pull from. Also it is backed up, and for this reason I > designate it the "shared" version. So the purpose of the sync.sh > script is to synchronize the personal laptop/desktop repository to the > on-campus version. In general, you should probably not have the script run "commit", but run it by hand whenever you've finished making changes (and probably done at least a quick syntax test, if possible). Otherwise, you'll often get the situation where you get distracted halfway through modifying a line, save something by habit, and then find this state getting sent to the system that other people pull from, and they'll find that they can't build your latest code. > Something I have learned from using mercurial in industry is that when > somebody messes up a "public repo" with conflicts they frequently > don't clean up the mess. This can be a sign that they have not > learned the lessons of cleanliness rather than ill intent. Otherwise > (and similarly) this messiness can be caused from not noticing that > they have left a mess. In the normal process of pushing changes to a shared repository, there is no possibility of leaving conflicts, because there's no way to get git to attempt a merge on push; the push simply fails, and you then pull, resolve any conflicts, commit the merge, and try again at pushing. > The motivation of having a sync script that is run on each user's > local repository is to decrease the likelihood of a mess. This is > achieved by first pulling from the common repository and resolving > conflicts _before_ "pushing" (note quotations) their changes to the > common repository. There is a possibility of a race condition that > leaves a conflict on the shared repository, however the risk is > diminished. > > Finally, I use "push" in quotes because actually the script uses only > uses the pull command. This prevents proliferation of branches on the > shared repository. It's better to configure the push refspec to do what you want, which seems to be "refs/heads/master:refs/heads/master", and use the fact that git push actually does what you're trying to do in a race-free and efficient way. Also, the "refs/heads/*:refs/heads/*" refspec is only different in the case where you use branches other than "master", and seems like it would be useful for your use case; if you actually want to send everything you do to the shared location, you want to have all of the branches you've made. (That is, you get one branch on the shared repository per distinct name of a branch that you use on a local repository, not one "master" for each local repository's "master") -Daniel *This .sig left intentionally blank* ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: git default behavior seems odd from a Unix command line point of view 2009-05-12 15:18 git default behavior seems odd from a Unix command line point of view Andrew Schein 2009-05-12 16:12 ` Junio C Hamano @ 2009-05-12 16:24 ` Jeff King 1 sibling, 0 replies; 8+ messages in thread From: Jeff King @ 2009-05-12 16:24 UTC (permalink / raw) To: Andrew Schein; +Cc: git On Tue, May 12, 2009 at 11:18:25AM -0400, Andrew Schein wrote: > The [1] in my prompt indicates the exit code of the git commands. What > I find odd is that even with the -q option, you get this verbose > output. Also, you get a non-zero exit status (which I would expect > only on a failure such as presence of an unresolved conflict). My git > usage is to have a number of small repositories and use a shell script > to loop over them and perform a sync with a centralized server. > Having all this wordy output on a "no sync necessary" scenario seems > counter the desired properties of output only when work is taking > place or when an error occurs. > > Have others developed git practices to sync a bunch or repositories > without all this verbose output on a "no change" scenario? Yes, I have such a script. I check: git ls-files -m -o -d --exclude-standard --directory --no-empty-directory If it produces any output, then there is something to commit (either a change in a tracked file, or an untracked file that might need to be added). I also do a fetch and check to see if we have any commits that need to be merged: git rev-list master..origin or any commits that we need to push: git rev-list origin..master (actually, it is a bit more complicated, since "master" and "origin" are just convention; I actually parse the config to find the branch pairs). -Peff ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2009-05-12 20:50 UTC | newest] Thread overview: 8+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-05-12 15:18 git default behavior seems odd from a Unix command line point of view Andrew Schein 2009-05-12 16:12 ` Junio C Hamano 2009-05-12 16:24 ` Andrew Schein 2009-05-12 16:34 ` Jeff King 2009-05-12 18:26 ` Daniel Barkalow 2009-05-12 20:05 ` Andrew Schein 2009-05-12 20:50 ` Daniel Barkalow 2009-05-12 16:24 ` Jeff King
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).