* Rename edge case...
@ 2012-11-09 9:10 John Szakmeister
2012-11-09 9:27 ` Tomas Carnecky
2012-11-09 16:09 ` Jeff King
0 siblings, 2 replies; 7+ messages in thread
From: John Szakmeister @ 2012-11-09 9:10 UTC (permalink / raw)
To: git
I've been browsing StackOverflow answering git-related questions, and
ran across this one:
<http://stackoverflow.com/questions/13300675/git-merge-rename-conflict>
It's a bit of an interesting situation. The user did a couple of
renames in a branch:
foo.txt => fooOld.txt
fooNew.txt => foo.txt
Meanwhile, master had an update to fooNew.txt. When the user tried to
merge master to the branch, it gave a merge conflict saying fooNew.txt
was deleted, but master tried to update it.
I was a bit surprised that git didn't follow the rename here, though I
do understand why: git only sees it as a rename if the source
disappears completely. So I played locally with a few ideas, and was
surprised to find out that even breaking up the two renames into two
separate commits git still didn't follow it.
I'm just curious--I don't run into this often myself--but is there a
good strategy for dealing with this that avoids the conflict?
Thanks!
-John
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case...
2012-11-09 9:10 Rename edge case John Szakmeister
@ 2012-11-09 9:27 ` Tomas Carnecky
2012-11-09 10:25 ` John Szakmeister
2012-11-09 16:09 ` Jeff King
1 sibling, 1 reply; 7+ messages in thread
From: Tomas Carnecky @ 2012-11-09 9:27 UTC (permalink / raw)
To: John Szakmeister, git
On Fri, 09 Nov 2012 04:10:31 -0500, John Szakmeister <john@szakmeister.net> wrote:
> I've been browsing StackOverflow answering git-related questions, and
> ran across this one:
> <http://stackoverflow.com/questions/13300675/git-merge-rename-conflict>
>
> It's a bit of an interesting situation. The user did a couple of
> renames in a branch:
> foo.txt => fooOld.txt
> fooNew.txt => foo.txt
>
> Meanwhile, master had an update to fooNew.txt. When the user tried to
> merge master to the branch, it gave a merge conflict saying fooNew.txt
> was deleted, but master tried to update it.
>
> I was a bit surprised that git didn't follow the rename here, though I
> do understand why: git only sees it as a rename if the source
> disappears completely. So I played locally with a few ideas, and was
> surprised to find out that even breaking up the two renames into two
> separate commits git still didn't follow it.
>
> I'm just curious--I don't run into this often myself--but is there a
> good strategy for dealing with this that avoids the conflict?
When merging two branches, git only looks at the tips. It doesn't inspect
their histories to see how the files were moved around. So i doesn't matter
whether you rename the files in a single commit or multiple commits. The
resulting tree is always the same.
tom
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case...
2012-11-09 9:27 ` Tomas Carnecky
@ 2012-11-09 10:25 ` John Szakmeister
2012-11-09 10:30 ` Nguyen Thai Ngoc Duy
2012-11-09 13:17 ` Johannes Sixt
0 siblings, 2 replies; 7+ messages in thread
From: John Szakmeister @ 2012-11-09 10:25 UTC (permalink / raw)
To: Tomas Carnecky; +Cc: git
On Fri, Nov 9, 2012 at 4:27 AM, Tomas Carnecky <tomas.carnecky@gmail.com> wrote:
[snip]
> When merging two branches, git only looks at the tips. It doesn't inspect
> their histories to see how the files were moved around. So i doesn't matter
> whether you rename the files in a single commit or multiple commits. The
> resulting tree is always the same.
I guess I figured that when I saw the final result, but didn't know if
there was a way to coax Git into doing a better job here. I guess not
though. At least it's a situation that doesn't come up often.
-John
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case...
2012-11-09 10:25 ` John Szakmeister
@ 2012-11-09 10:30 ` Nguyen Thai Ngoc Duy
2012-11-09 13:17 ` Johannes Sixt
1 sibling, 0 replies; 7+ messages in thread
From: Nguyen Thai Ngoc Duy @ 2012-11-09 10:30 UTC (permalink / raw)
To: John Szakmeister; +Cc: Tomas Carnecky, git
On Fri, Nov 9, 2012 at 5:25 PM, John Szakmeister <john@szakmeister.net> wrote:
> On Fri, Nov 9, 2012 at 4:27 AM, Tomas Carnecky <tomas.carnecky@gmail.com> wrote:
> [snip]
>> When merging two branches, git only looks at the tips. It doesn't inspect
>> their histories to see how the files were moved around. So i doesn't matter
>> whether you rename the files in a single commit or multiple commits. The
>> resulting tree is always the same.
>
> I guess I figured that when I saw the final result, but didn't know if
> there was a way to coax Git into doing a better job here. I guess not
> though.
There is not (yet). There were a few patches around that allow users
to override git's automatic renames. You can search the mail archive.
I think the keywords are "rename cache". Thanks for reminding me for
putting a higher priority on that.
--
Duy
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case...
2012-11-09 10:25 ` John Szakmeister
2012-11-09 10:30 ` Nguyen Thai Ngoc Duy
@ 2012-11-09 13:17 ` Johannes Sixt
1 sibling, 0 replies; 7+ messages in thread
From: Johannes Sixt @ 2012-11-09 13:17 UTC (permalink / raw)
To: John Szakmeister; +Cc: Tomas Carnecky, git
Am 11/9/2012 11:25, schrieb John Szakmeister:
> On Fri, Nov 9, 2012 at 4:27 AM, Tomas Carnecky <tomas.carnecky@gmail.com> wrote:
> [snip]
>> When merging two branches, git only looks at the tips. It doesn't inspect
>> their histories to see how the files were moved around. So i doesn't matter
>> whether you rename the files in a single commit or multiple commits. The
>> resulting tree is always the same.
>
> I guess I figured that when I saw the final result, but didn't know if
> there was a way to coax Git into doing a better job here.
If the renames are split in two commits, you can merge the first, and then
the second on top of the result.
-- Hannes
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case...
2012-11-09 9:10 Rename edge case John Szakmeister
2012-11-09 9:27 ` Tomas Carnecky
@ 2012-11-09 16:09 ` Jeff King
2012-11-10 2:01 ` John Szakmeister
1 sibling, 1 reply; 7+ messages in thread
From: Jeff King @ 2012-11-09 16:09 UTC (permalink / raw)
To: John Szakmeister; +Cc: git
On Fri, Nov 09, 2012 at 04:10:31AM -0500, John Szakmeister wrote:
> I've been browsing StackOverflow answering git-related questions, and
> ran across this one:
> <http://stackoverflow.com/questions/13300675/git-merge-rename-conflict>
>
> It's a bit of an interesting situation. The user did a couple of
> renames in a branch:
> foo.txt => fooOld.txt
> fooNew.txt => foo.txt
>
> Meanwhile, master had an update to fooNew.txt. When the user tried to
> merge master to the branch, it gave a merge conflict saying fooNew.txt
> was deleted, but master tried to update it.
>
> I was a bit surprised that git didn't follow the rename here, though I
> do understand why: git only sees it as a rename if the source
> disappears completely.
Right. If the source didn't go away, it would be a copy. We can do copy
detection, but it is not quite as obvious what a merge should do with a
copy (apply the change to the original? To the copy? In both places? You
would really want hunk-level copy detection for it to make any sense).
Usually git deals with this double-rename case through the use of
"break" or "rewrite" detection. We notice that the old "foo.txt" and the
new "foo.txt" do not look very much like each other, and break the
modification apart into an add and a delete. That makes each side
eligible for rename detection, and we can end up finding the pairs of
renames above.
So in theory it just as simple as a one-liner to turn on break-detection
in merge-recursive. Sadly, that only reveals more issues with how
merge-recursive handles renames. See this thread, which has pointers to
the breakages at the end:
http://thread.gmane.org/gmane.comp.version-control.git/169944
I've become convinced that the best way forward with merge-recursive is
to scrap and rewrite it. It tries to do things in a muddled order, which
makes it very brittle to changes like this. I think it needs to have an
internal representation of the tree that can represent all of the
conflicts, and then follow a few simple phases:
1. "structural" 3-way merge handling renames, breaks, typechanges,
etc. Each path in tree might show things like D/F conflicts, or it
might show content-level merges that still need to happen, even if
the content from those merges is not coming from the same paths in
the source trees.
2. Resolve content-level 3-way merges at each path.
3. Compare the proposed tree to the working tree and list any problems
(e.g., untracked files or local modifications that will be
overwritten).
Right now it tries to do these things interleaved as it processes paths,
and as a result we've had many bugs (e.g., the content-level merge
conflating the content originally at a path and something that was
renamed into place, and missing corner cases where we actually overwrite
untracked files that should be considered precious).
But that is just off the top of my head. I haven't looked at the topic
in quite a while (and I haven't even started working on any such
rewrite).
> So I played locally with a few ideas, and was surprised to find out
> that even breaking up the two renames into two separate commits git
> still didn't follow it.
Right, because the merge only looks at the end points. Try doing a
"diff -M" between your endpoints with and without "-B". We do not have
any double-renames in git.git, but you can find "-B" helping a similar
case: most of a file's content is moved elsewhere, but some small amount
remains. For example, try this in git.git, with and without -B:
git show -M --stat --summary --patch 043a449
It finds the rename only with "-B", which would help a merge (it also
makes the diff shorter and more readable, as you can see what was
changed as the content migrated to the new file).
-Peff
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Rename edge case...
2012-11-09 16:09 ` Jeff King
@ 2012-11-10 2:01 ` John Szakmeister
0 siblings, 0 replies; 7+ messages in thread
From: John Szakmeister @ 2012-11-10 2:01 UTC (permalink / raw)
To: Jeff King; +Cc: git
On Fri, Nov 9, 2012 at 11:09 AM, Jeff King <peff@peff.net> wrote:
[snip]
> Right. If the source didn't go away, it would be a copy. We can do copy
> detection, but it is not quite as obvious what a merge should do with a
> copy (apply the change to the original? To the copy? In both places? You
> would really want hunk-level copy detection for it to make any sense).
Yeah, I wasn't advocating that. More along the lines of what you're
talking about below...
> Usually git deals with this double-rename case through the use of
> "break" or "rewrite" detection. We notice that the old "foo.txt" and the
> new "foo.txt" do not look very much like each other, and break the
> modification apart into an add and a delete. That makes each side
> eligible for rename detection, and we can end up finding the pairs of
> renames above.
I did try using the -B option, and it did detect that foo.txt was
renamed to fooOld.txt, but it didn't show fooNew.txt being renamed to
foo.txt. I'm running git 1.7.12.3. It could be that 1.8.0 does
better, but I haven't tried.
> So in theory it just as simple as a one-liner to turn on break-detection
> in merge-recursive. Sadly, that only reveals more issues with how
> merge-recursive handles renames. See this thread, which has pointers to
> the breakages at the end:
>
> http://thread.gmane.org/gmane.comp.version-control.git/169944
Thank you. I'll definitely read up on this.
> I've become convinced that the best way forward with merge-recursive is
> to scrap and rewrite it. It tries to do things in a muddled order, which
> makes it very brittle to changes like this. I think it needs to have an
> internal representation of the tree that can represent all of the
> conflicts, and then follow a few simple phases:
>
> 1. "structural" 3-way merge handling renames, breaks, typechanges,
> etc. Each path in tree might show things like D/F conflicts, or it
> might show content-level merges that still need to happen, even if
> the content from those merges is not coming from the same paths in
> the source trees.
>
> 2. Resolve content-level 3-way merges at each path.
>
> 3. Compare the proposed tree to the working tree and list any problems
> (e.g., untracked files or local modifications that will be
> overwritten).
>
> Right now it tries to do these things interleaved as it processes paths,
> and as a result we've had many bugs (e.g., the content-level merge
> conflating the content originally at a path and something that was
> renamed into place, and missing corner cases where we actually overwrite
> untracked files that should be considered precious).
>
> But that is just off the top of my head. I haven't looked at the topic
> in quite a while (and I haven't even started working on any such
> rewrite).
That certainly sounds like a better approach.
>> So I played locally with a few ideas, and was surprised to find out
>> that even breaking up the two renames into two separate commits git
>> still didn't follow it.
>
> Right, because the merge only looks at the end points. Try doing a
> "diff -M" between your endpoints with and without "-B". We do not have
> any double-renames in git.git, but you can find "-B" helping a similar
> case: most of a file's content is moved elsewhere, but some small amount
> remains. For example, try this in git.git, with and without -B:
>
> git show -M --stat --summary --patch 043a449
>
> It finds the rename only with "-B", which would help a merge (it also
> makes the diff shorter and more readable, as you can see what was
> changed as the content migrated to the new file).
I've played with the -B option before, and it's definitely nice in
certain cases.
Thank you for taking the time to write all this up. It was very informative!
-John
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2012-11-10 2:01 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-11-09 9:10 Rename edge case John Szakmeister
2012-11-09 9:27 ` Tomas Carnecky
2012-11-09 10:25 ` John Szakmeister
2012-11-09 10:30 ` Nguyen Thai Ngoc Duy
2012-11-09 13:17 ` Johannes Sixt
2012-11-09 16:09 ` Jeff King
2012-11-10 2:01 ` John Szakmeister
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).