* Rename edge case... @ 2012-11-09 9:10 John Szakmeister 2012-11-09 9:27 ` Tomas Carnecky 2012-11-09 16:09 ` Jeff King 0 siblings, 2 replies; 7+ messages in thread From: John Szakmeister @ 2012-11-09 9:10 UTC (permalink / raw) To: git I've been browsing StackOverflow answering git-related questions, and ran across this one: <http://stackoverflow.com/questions/13300675/git-merge-rename-conflict> It's a bit of an interesting situation. The user did a couple of renames in a branch: foo.txt => fooOld.txt fooNew.txt => foo.txt Meanwhile, master had an update to fooNew.txt. When the user tried to merge master to the branch, it gave a merge conflict saying fooNew.txt was deleted, but master tried to update it. I was a bit surprised that git didn't follow the rename here, though I do understand why: git only sees it as a rename if the source disappears completely. So I played locally with a few ideas, and was surprised to find out that even breaking up the two renames into two separate commits git still didn't follow it. I'm just curious--I don't run into this often myself--but is there a good strategy for dealing with this that avoids the conflict? Thanks! -John ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case... 2012-11-09 9:10 Rename edge case John Szakmeister @ 2012-11-09 9:27 ` Tomas Carnecky 2012-11-09 10:25 ` John Szakmeister 2012-11-09 16:09 ` Jeff King 1 sibling, 1 reply; 7+ messages in thread From: Tomas Carnecky @ 2012-11-09 9:27 UTC (permalink / raw) To: John Szakmeister, git On Fri, 09 Nov 2012 04:10:31 -0500, John Szakmeister <john@szakmeister.net> wrote: > I've been browsing StackOverflow answering git-related questions, and > ran across this one: > <http://stackoverflow.com/questions/13300675/git-merge-rename-conflict> > > It's a bit of an interesting situation. The user did a couple of > renames in a branch: > foo.txt => fooOld.txt > fooNew.txt => foo.txt > > Meanwhile, master had an update to fooNew.txt. When the user tried to > merge master to the branch, it gave a merge conflict saying fooNew.txt > was deleted, but master tried to update it. > > I was a bit surprised that git didn't follow the rename here, though I > do understand why: git only sees it as a rename if the source > disappears completely. So I played locally with a few ideas, and was > surprised to find out that even breaking up the two renames into two > separate commits git still didn't follow it. > > I'm just curious--I don't run into this often myself--but is there a > good strategy for dealing with this that avoids the conflict? When merging two branches, git only looks at the tips. It doesn't inspect their histories to see how the files were moved around. So i doesn't matter whether you rename the files in a single commit or multiple commits. The resulting tree is always the same. tom ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case... 2012-11-09 9:27 ` Tomas Carnecky @ 2012-11-09 10:25 ` John Szakmeister 2012-11-09 10:30 ` Nguyen Thai Ngoc Duy 2012-11-09 13:17 ` Johannes Sixt 0 siblings, 2 replies; 7+ messages in thread From: John Szakmeister @ 2012-11-09 10:25 UTC (permalink / raw) To: Tomas Carnecky; +Cc: git On Fri, Nov 9, 2012 at 4:27 AM, Tomas Carnecky <tomas.carnecky@gmail.com> wrote: [snip] > When merging two branches, git only looks at the tips. It doesn't inspect > their histories to see how the files were moved around. So i doesn't matter > whether you rename the files in a single commit or multiple commits. The > resulting tree is always the same. I guess I figured that when I saw the final result, but didn't know if there was a way to coax Git into doing a better job here. I guess not though. At least it's a situation that doesn't come up often. -John ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case... 2012-11-09 10:25 ` John Szakmeister @ 2012-11-09 10:30 ` Nguyen Thai Ngoc Duy 2012-11-09 13:17 ` Johannes Sixt 1 sibling, 0 replies; 7+ messages in thread From: Nguyen Thai Ngoc Duy @ 2012-11-09 10:30 UTC (permalink / raw) To: John Szakmeister; +Cc: Tomas Carnecky, git On Fri, Nov 9, 2012 at 5:25 PM, John Szakmeister <john@szakmeister.net> wrote: > On Fri, Nov 9, 2012 at 4:27 AM, Tomas Carnecky <tomas.carnecky@gmail.com> wrote: > [snip] >> When merging two branches, git only looks at the tips. It doesn't inspect >> their histories to see how the files were moved around. So i doesn't matter >> whether you rename the files in a single commit or multiple commits. The >> resulting tree is always the same. > > I guess I figured that when I saw the final result, but didn't know if > there was a way to coax Git into doing a better job here. I guess not > though. There is not (yet). There were a few patches around that allow users to override git's automatic renames. You can search the mail archive. I think the keywords are "rename cache". Thanks for reminding me for putting a higher priority on that. -- Duy ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case... 2012-11-09 10:25 ` John Szakmeister 2012-11-09 10:30 ` Nguyen Thai Ngoc Duy @ 2012-11-09 13:17 ` Johannes Sixt 1 sibling, 0 replies; 7+ messages in thread From: Johannes Sixt @ 2012-11-09 13:17 UTC (permalink / raw) To: John Szakmeister; +Cc: Tomas Carnecky, git Am 11/9/2012 11:25, schrieb John Szakmeister: > On Fri, Nov 9, 2012 at 4:27 AM, Tomas Carnecky <tomas.carnecky@gmail.com> wrote: > [snip] >> When merging two branches, git only looks at the tips. It doesn't inspect >> their histories to see how the files were moved around. So i doesn't matter >> whether you rename the files in a single commit or multiple commits. The >> resulting tree is always the same. > > I guess I figured that when I saw the final result, but didn't know if > there was a way to coax Git into doing a better job here. If the renames are split in two commits, you can merge the first, and then the second on top of the result. -- Hannes ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case... 2012-11-09 9:10 Rename edge case John Szakmeister 2012-11-09 9:27 ` Tomas Carnecky @ 2012-11-09 16:09 ` Jeff King 2012-11-10 2:01 ` John Szakmeister 1 sibling, 1 reply; 7+ messages in thread From: Jeff King @ 2012-11-09 16:09 UTC (permalink / raw) To: John Szakmeister; +Cc: git On Fri, Nov 09, 2012 at 04:10:31AM -0500, John Szakmeister wrote: > I've been browsing StackOverflow answering git-related questions, and > ran across this one: > <http://stackoverflow.com/questions/13300675/git-merge-rename-conflict> > > It's a bit of an interesting situation. The user did a couple of > renames in a branch: > foo.txt => fooOld.txt > fooNew.txt => foo.txt > > Meanwhile, master had an update to fooNew.txt. When the user tried to > merge master to the branch, it gave a merge conflict saying fooNew.txt > was deleted, but master tried to update it. > > I was a bit surprised that git didn't follow the rename here, though I > do understand why: git only sees it as a rename if the source > disappears completely. Right. If the source didn't go away, it would be a copy. We can do copy detection, but it is not quite as obvious what a merge should do with a copy (apply the change to the original? To the copy? In both places? You would really want hunk-level copy detection for it to make any sense). Usually git deals with this double-rename case through the use of "break" or "rewrite" detection. We notice that the old "foo.txt" and the new "foo.txt" do not look very much like each other, and break the modification apart into an add and a delete. That makes each side eligible for rename detection, and we can end up finding the pairs of renames above. So in theory it just as simple as a one-liner to turn on break-detection in merge-recursive. Sadly, that only reveals more issues with how merge-recursive handles renames. See this thread, which has pointers to the breakages at the end: http://thread.gmane.org/gmane.comp.version-control.git/169944 I've become convinced that the best way forward with merge-recursive is to scrap and rewrite it. It tries to do things in a muddled order, which makes it very brittle to changes like this. I think it needs to have an internal representation of the tree that can represent all of the conflicts, and then follow a few simple phases: 1. "structural" 3-way merge handling renames, breaks, typechanges, etc. Each path in tree might show things like D/F conflicts, or it might show content-level merges that still need to happen, even if the content from those merges is not coming from the same paths in the source trees. 2. Resolve content-level 3-way merges at each path. 3. Compare the proposed tree to the working tree and list any problems (e.g., untracked files or local modifications that will be overwritten). Right now it tries to do these things interleaved as it processes paths, and as a result we've had many bugs (e.g., the content-level merge conflating the content originally at a path and something that was renamed into place, and missing corner cases where we actually overwrite untracked files that should be considered precious). But that is just off the top of my head. I haven't looked at the topic in quite a while (and I haven't even started working on any such rewrite). > So I played locally with a few ideas, and was surprised to find out > that even breaking up the two renames into two separate commits git > still didn't follow it. Right, because the merge only looks at the end points. Try doing a "diff -M" between your endpoints with and without "-B". We do not have any double-renames in git.git, but you can find "-B" helping a similar case: most of a file's content is moved elsewhere, but some small amount remains. For example, try this in git.git, with and without -B: git show -M --stat --summary --patch 043a449 It finds the rename only with "-B", which would help a merge (it also makes the diff shorter and more readable, as you can see what was changed as the content migrated to the new file). -Peff ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case... 2012-11-09 16:09 ` Jeff King @ 2012-11-10 2:01 ` John Szakmeister 0 siblings, 0 replies; 7+ messages in thread From: John Szakmeister @ 2012-11-10 2:01 UTC (permalink / raw) To: Jeff King; +Cc: git On Fri, Nov 9, 2012 at 11:09 AM, Jeff King <peff@peff.net> wrote: [snip] > Right. If the source didn't go away, it would be a copy. We can do copy > detection, but it is not quite as obvious what a merge should do with a > copy (apply the change to the original? To the copy? In both places? You > would really want hunk-level copy detection for it to make any sense). Yeah, I wasn't advocating that. More along the lines of what you're talking about below... > Usually git deals with this double-rename case through the use of > "break" or "rewrite" detection. We notice that the old "foo.txt" and the > new "foo.txt" do not look very much like each other, and break the > modification apart into an add and a delete. That makes each side > eligible for rename detection, and we can end up finding the pairs of > renames above. I did try using the -B option, and it did detect that foo.txt was renamed to fooOld.txt, but it didn't show fooNew.txt being renamed to foo.txt. I'm running git 1.7.12.3. It could be that 1.8.0 does better, but I haven't tried. > So in theory it just as simple as a one-liner to turn on break-detection > in merge-recursive. Sadly, that only reveals more issues with how > merge-recursive handles renames. See this thread, which has pointers to > the breakages at the end: > > http://thread.gmane.org/gmane.comp.version-control.git/169944 Thank you. I'll definitely read up on this. > I've become convinced that the best way forward with merge-recursive is > to scrap and rewrite it. It tries to do things in a muddled order, which > makes it very brittle to changes like this. I think it needs to have an > internal representation of the tree that can represent all of the > conflicts, and then follow a few simple phases: > > 1. "structural" 3-way merge handling renames, breaks, typechanges, > etc. Each path in tree might show things like D/F conflicts, or it > might show content-level merges that still need to happen, even if > the content from those merges is not coming from the same paths in > the source trees. > > 2. Resolve content-level 3-way merges at each path. > > 3. Compare the proposed tree to the working tree and list any problems > (e.g., untracked files or local modifications that will be > overwritten). > > Right now it tries to do these things interleaved as it processes paths, > and as a result we've had many bugs (e.g., the content-level merge > conflating the content originally at a path and something that was > renamed into place, and missing corner cases where we actually overwrite > untracked files that should be considered precious). > > But that is just off the top of my head. I haven't looked at the topic > in quite a while (and I haven't even started working on any such > rewrite). That certainly sounds like a better approach. >> So I played locally with a few ideas, and was surprised to find out >> that even breaking up the two renames into two separate commits git >> still didn't follow it. > > Right, because the merge only looks at the end points. Try doing a > "diff -M" between your endpoints with and without "-B". We do not have > any double-renames in git.git, but you can find "-B" helping a similar > case: most of a file's content is moved elsewhere, but some small amount > remains. For example, try this in git.git, with and without -B: > > git show -M --stat --summary --patch 043a449 > > It finds the rename only with "-B", which would help a merge (it also > makes the diff shorter and more readable, as you can see what was > changed as the content migrated to the new file). I've played with the -B option before, and it's definitely nice in certain cases. Thank you for taking the time to write all this up. It was very informative! -John ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2012-11-10 2:01 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2012-11-09 9:10 Rename edge case John Szakmeister 2012-11-09 9:27 ` Tomas Carnecky 2012-11-09 10:25 ` John Szakmeister 2012-11-09 10:30 ` Nguyen Thai Ngoc Duy 2012-11-09 13:17 ` Johannes Sixt 2012-11-09 16:09 ` Jeff King 2012-11-10 2:01 ` John Szakmeister
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).