* git apply: git diff header lacks filename information for git diff --no-index patch @ 2008-10-02 18:27 Imre Deak 2008-10-04 4:17 ` Jeff King 0 siblings, 1 reply; 10+ messages in thread From: Imre Deak @ 2008-10-02 18:27 UTC (permalink / raw) To: git Hi, I have the following problem: $ echo -e '\x0' > a $ git diff --no-index --binary /dev/null a > patch $ rm a $ git apply patch fatal: git diff header lacks filename information (line 4) $ cat patch diff --git a/dev/null b/a new file mode 100644 index 0000000000000000000000000000000000000000..1f2a4f5ef3df7f7456d91c961da36fc58904f2f1 GIT binary patch literal 2 JcmZSJ0ssIE01E&B literal 0 HcmV?d00001 The same works for text based patches: $ echo 1 > a $ git diff --no-index /dev/null a > patch $ rm a $ git apply patch $ ls a $ cat patch diff --git a/dev/null b/a new file mode 100644 index 0000000..d00491f --- /dev/null +++ b/a @@ -0,0 +1 @@ +1 The binary patch lacks ---/+++ lines but still provides the name info on the diff --git line which I think should suffice for git apply. --Imre ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-02 18:27 git apply: git diff header lacks filename information for git diff --no-index patch Imre Deak @ 2008-10-04 4:17 ` Jeff King 2008-10-04 8:28 ` Jakub Narebski 2008-10-04 16:54 ` Linus Torvalds 0 siblings, 2 replies; 10+ messages in thread From: Jeff King @ 2008-10-04 4:17 UTC (permalink / raw) To: Imre Deak; +Cc: Linus Torvalds, git On Thu, Oct 02, 2008 at 09:27:36PM +0300, Imre Deak wrote: > $ git apply patch > fatal: git diff header lacks filename information (line 4) > $ cat patch > diff --git a/dev/null b/a > new file mode 100644 > index 0000000000000000000000000000000000000000..1f2a4f5ef3df7f7456d91c961da36fc58904f2f1 > GIT binary patch Hmm. The problem is that "git apply" doesn't accept that "a/dev/null" and "b/a" are the same, so it rejects them as a name. I guess on a text patch, we would just pull that information from the "---" and "+++" lines, so we don't care that it's not on the diff commandline. However, a _non_ --no-index patch doesn't produce the same output. It will actually produce the line: diff --git a/a b/a even if it is a creation patch. So I'm not sure which piece of code is at fault. Either: 1. git apply is right to reject, and "git diff --no-index" should be putting the actual filename on the commandline of a binary patch instead of /dev/null, even if it is a creation patch. or 2. git apply should accept this construct. Perhaps we should relax the "both names must be the same" rule if one of the names is /dev/null (and we would take the other)? Linus, the "both names must be the same" code in git_header_name blames to you (5041aa70). Thoughts on number 2? -Peff ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-04 4:17 ` Jeff King @ 2008-10-04 8:28 ` Jakub Narebski 2008-10-05 19:19 ` Jeff King 2008-10-04 16:54 ` Linus Torvalds 1 sibling, 1 reply; 10+ messages in thread From: Jakub Narebski @ 2008-10-04 8:28 UTC (permalink / raw) To: git [Cc: git@vger.kernel.org, Jeff King <peff@peff.net>, Imre Deak <imre.deak@gmail.com>] Jeff King wrote: > On Thu, Oct 02, 2008 at 09:27:36PM +0300, Imre Deak wrote: > >> $ git apply patch >> fatal: git diff header lacks filename information (line 4) >> $ cat patch >> diff --git a/dev/null b/a >> new file mode 100644 >> index 0000000000000000000000000000000000000000..1f2a4f5ef3df7f7456d91c961da36fc58904f2f1 >> GIT binary patch > > Hmm. The problem is that "git apply" doesn't accept that "a/dev/null" > and "b/a" are the same, so it rejects them as a name. I Shouldn't it be "/dev/null", not "a/dev/null"? Besides git-diff(1) states: 1. It is preceded with a "git diff" header, that looks like this: diff --git a/file1 b/file2 The `a/` and `b/` filenames are the same unless rename/copy is involved. Especially, even for a creation or a deletion, `/dev/null` is _not_ used in place of `a/` or `b/` filenames. Looks like a bug in patch generation code... -- Jakub Narebski Warsaw, Poland ShadeHawk on #git ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-04 8:28 ` Jakub Narebski @ 2008-10-05 19:19 ` Jeff King 0 siblings, 0 replies; 10+ messages in thread From: Jeff King @ 2008-10-05 19:19 UTC (permalink / raw) To: Jakub Narebski; +Cc: Imre Deak, git [argh, resending with the list cc'd] On Sat, Oct 04, 2008 at 10:28:19AM +0200, Jakub Narebski wrote: > > Hmm. The problem is that "git apply" doesn't accept that "a/dev/null" > > and "b/a" are the same, so it rejects them as a name. I > > Shouldn't it be "/dev/null", not "a/dev/null"? Yes, see my recent reply to Linus elsewhere in the thread for why I didn't think it was an issue at the time (but it clearly is). > Besides git-diff(1) states: > > 1. It is preceded with a "git diff" header, that looks like > this: > > diff --git a/file1 b/file2 > > The `a/` and `b/` filenames are the same unless rename/copy is > involved. Especially, even for a creation or a deletion, > `/dev/null` is _not_ used in place of `a/` or `b/` filenames. And I hadn't seen this, which makes it utterly clear that the diff is broken. Thanks for pointing it out. -Peff ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-04 4:17 ` Jeff King 2008-10-04 8:28 ` Jakub Narebski @ 2008-10-04 16:54 ` Linus Torvalds 2008-10-04 17:08 ` Linus Torvalds 2008-10-05 19:17 ` Jeff King 1 sibling, 2 replies; 10+ messages in thread From: Linus Torvalds @ 2008-10-04 16:54 UTC (permalink / raw) To: Jeff King; +Cc: Imre Deak, git On Sat, 4 Oct 2008, Jeff King wrote: > On Thu, Oct 02, 2008 at 09:27:36PM +0300, Imre Deak wrote: > > $ git apply patch > > fatal: git diff header lacks filename information (line 4) > > $ cat patch > > diff --git a/dev/null b/a > > new file mode 100644 > > index 0000000000000000000000000000000000000000..1f2a4f5ef3df7f7456d91c961da36fc58904f2f1 > > GIT binary patch Ok, this patch is broken in so many ways.. > Hmm. The problem is that "git apply" doesn't accept that "a/dev/null" > and "b/a" are the same, so it rejects them as a name. I guess on a text > patch, we would just pull that information from the "---" and "+++" > lines, so we don't care that it's not on the diff commandline. Exactly. In order to avoid all the ambiguities, we want the filename to match on the 'diff -' line to even be able to guess, and if it doesn't, we should pick it up from the "rename from" lines (for a git diff), or from the '--- a/filename'/'+++ b/filename' otherwise (if it's not a rename, or not a git diff). And being a binary diff, and a creation event, all of this fails. To make things worse, git has also screwed up the "/dev/null" and prepended the prefix to it, making it even harder to see any patterns to the names. Gaah. > However, a _non_ --no-index patch doesn't produce the same output. It > will actually produce the line: > > diff --git a/a b/a > > even if it is a creation patch. So I'm not sure which piece of code is > at fault. It's some of both. I think --no-index is broken, that "a/dev/null" is total crap regardless of anything else. There's no way that thing is correct, and even if you were not to want "a/a", it should have been just plain "/dev/null". But for a native git patch, we shouldn't even use the (at least slightly more correct) "/dev/null", since native git application doesn't actually do the "is_dev_null()" for native patches, and just wants the filename to be the same. So I think "git apply" is correct, and "git diff --no-index" is broken. That said, I think git-apply being "correct" is not a great excuse, and we should do the "be liberal in what you accept, conservative in what you generate", and think about how to make git-apply be more willing to handle this case too. Quite frankly, I should have doen the explicit headers as "new file " <mode> SP <name> instead of "new file mode " <mode> (and same for "deleted file", obviously) and the patch would have been more readable _and_ we'd have avoided this issue. But when I did all that, I didn't even think of binary diffs (they weren't an issue originally). Dang. Anyway, for now I'd suggest a patch like this. Hmm? The "diff --git" format assumes that the names are the same, so make it so. Linus --- diff.c | 5 +++-- 1 files changed, 3 insertions(+), 2 deletions(-) diff --git a/diff.c b/diff.c index 9053d4d..0910b4e 100644 --- a/diff.c +++ b/diff.c @@ -1479,7 +1479,7 @@ static void builtin_diff(const char *name_a, { mmfile_t mf1, mf2; const char *lbl[2]; - char *a_one, *b_two; + char *a_one, *b_two, *h_name; const char *set = diff_get_color_opt(o, DIFF_METAINFO); const char *reset = diff_get_color_opt(o, DIFF_RESET); const char *a_prefix, *b_prefix; @@ -1497,7 +1497,8 @@ static void builtin_diff(const char *name_a, b_two = quote_two(b_prefix, name_b + (*name_b == '/')); lbl[0] = DIFF_FILE_VALID(one) ? a_one : "/dev/null"; lbl[1] = DIFF_FILE_VALID(two) ? b_two : "/dev/null"; - fprintf(o->file, "%sdiff --git %s %s%s\n", set, a_one, b_two, reset); + h_name = DIFF_FILE_VALID(one) ? a_one : b_two; + fprintf(o->file, "%sdiff --git %s %s%s\n", set, h_name, h_name, reset); if (lbl[0][0] == '/') { /* /dev/null */ fprintf(o->file, "%snew file mode %06o%s\n", set, two->mode, reset); ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-04 16:54 ` Linus Torvalds @ 2008-10-04 17:08 ` Linus Torvalds 2008-10-04 17:48 ` Linus Torvalds 2008-10-05 19:17 ` Jeff King 1 sibling, 1 reply; 10+ messages in thread From: Linus Torvalds @ 2008-10-04 17:08 UTC (permalink / raw) To: Jeff King; +Cc: Imre Deak, git On Sat, 4 Oct 2008, Linus Torvalds wrote: > > Anyway, for now I'd suggest a patch like this. Hmm? The "diff --git" > format assumes that the names are the same, so make it so. Ahh, never mind, this one is broken. It's not _horribly_ so, but I shouldhave checked the test breakage. It changes non-broken cases from diff --git a/file b/file into diff --git a/file a/file so the header thing would need to be done differently. Don't even bother with that broken patch. I'll think about doing it better. Linus ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-04 17:08 ` Linus Torvalds @ 2008-10-04 17:48 ` Linus Torvalds 2008-10-05 19:21 ` Jeff King 0 siblings, 1 reply; 10+ messages in thread From: Linus Torvalds @ 2008-10-04 17:48 UTC (permalink / raw) To: Jeff King; +Cc: Imre Deak, git On Sat, 4 Oct 2008, Linus Torvalds wrote: > > Ahh, never mind, this one is broken. Ok, so here's a much better version, I think. It just refuses to use an invalid name for anything visible in the diff. Now, this fixes the "git diff --no-index --binary a /dev/null" kind of case (and we'll end up using "a" as the basename), but some other insane cases are impossible to handle. If you do git diff --no-index --binary a /bin/echo you'll still get a patch like diff --git a/a b/bin/echo old mode 100644 new mode 100755 index ... and "git apply" will refuse to apply it for a couple of reasons, and the diff is simply bogus. And that, btw, is no longer a bug, I think. It's impossible to know whethe the user meant for the patch to be a rename or not. And as such, refusing to apply it because you don't know what name you should use is probably _exactly_ the right thing to do! Linus ---- diff.c | 4 ++++ 1 files changed, 4 insertions(+), 0 deletions(-) diff --git a/diff.c b/diff.c index 9053d4d..197533e 100644 --- a/diff.c +++ b/diff.c @@ -1493,6 +1493,10 @@ static void builtin_diff(const char *name_a, b_prefix = o->b_prefix; } + /* Never use a non-valid filename anywhere if at all possible */ + name_a = DIFF_FILE_VALID(one) ? name_a : name_b; + name_b = DIFF_FILE_VALID(two) ? name_b : name_a; + a_one = quote_two(a_prefix, name_a + (*name_a == '/')); b_two = quote_two(b_prefix, name_b + (*name_b == '/')); lbl[0] = DIFF_FILE_VALID(one) ? a_one : "/dev/null"; ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-04 17:48 ` Linus Torvalds @ 2008-10-05 19:21 ` Jeff King 0 siblings, 0 replies; 10+ messages in thread From: Jeff King @ 2008-10-05 19:21 UTC (permalink / raw) To: Linus Torvalds; +Cc: Imre Deak, git On Sat, Oct 04, 2008 at 10:48:08AM -0700, Linus Torvalds wrote: > Ok, so here's a much better version, I think. Yes, this approach makes sense to me after reading your other explanation as well as Jakub's pointer to git-diff(1). And it passes my test. ;) I'll squash in my test case, add a little context to the commit message, and send it to Shawn. -Peff ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-04 16:54 ` Linus Torvalds 2008-10-04 17:08 ` Linus Torvalds @ 2008-10-05 19:17 ` Jeff King 2008-10-05 19:24 ` Jeff King 1 sibling, 1 reply; 10+ messages in thread From: Jeff King @ 2008-10-05 19:17 UTC (permalink / raw) To: Linus Torvalds; +Cc: Imre Deak, git On Sat, Oct 04, 2008 at 09:54:36AM -0700, Linus Torvalds wrote: > Exactly. In order to avoid all the ambiguities, we want the filename to > match on the 'diff -' line to even be able to guess, and if it doesn't, we > should pick it up from the "rename from" lines (for a git diff), or from > the '--- a/filename'/'+++ b/filename' otherwise (if it's not a rename, or > not a git diff). > > And being a binary diff, and a creation event, all of this fails. I wonder if it might have been better for binary diffs, like text diffs, to contain the "from" and "to" filenames in a similar format. But at this point I don't think a format change is really worthwhile. > To make things worse, git has also screwed up the "/dev/null" and > prepended the prefix to it, making it even harder to see any patterns to > the names. Gaah. Yes, I noticed that, as well. And obviously it looks bogus, but I thought I managed to get "git diff" to produce "a/dev/null" on some otherwise valid input, and so assumed that was something we were able to work around in applying the patch. But testing again today, I can't seem to get anything except this broken diff to say "a/dev/null". So probably I was just mistaken yesterday. > So I think "git apply" is correct, and "git diff --no-index" is broken. OK, your reasoning is sound. > That said, I think git-apply being "correct" is not a great excuse, and we > should do the "be liberal in what you accept, conservative in what you > generate", and think about how to make git-apply be more willing to handle > this case too. I think for now we might as well just fix the broken "diff" output. The only thing generating these broken diffs will be older versions of git, and such diffs are presumably rare (given that this is the first report we've seen). So the only advantage would be to accept rare patches from people with older git, versus asking them to upgrade to a non-broken git. > Quite frankly, I should have doen the explicit headers as > > "new file " <mode> SP <name> > > instead of > > "new file mode " <mode> > > (and same for "deleted file", obviously) and the patch would have been > more readable _and_ we'd have avoided this issue. But when I did all that, > I didn't even think of binary diffs (they weren't an issue originally). Agreed (and I think this is just another form of what I mentioned above; in my suggestion, we would include the filename on _all_ binary diffs. In practice, it wouldn't matter in non-creation cases, since we actually get the "diff --git" line _right_ in those cases :) ). But again, I don't think it is worth trying to change the format now. -Peff ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: git apply: git diff header lacks filename information for git diff --no-index patch 2008-10-05 19:17 ` Jeff King @ 2008-10-05 19:24 ` Jeff King 0 siblings, 0 replies; 10+ messages in thread From: Jeff King @ 2008-10-05 19:24 UTC (permalink / raw) To: Linus Torvalds; +Cc: Imre Deak, git On Sun, Oct 05, 2008 at 03:17:28PM -0400, Jeff King wrote: > Yes, I noticed that, as well. And obviously it looks bogus, but I > thought I managed to get "git diff" to produce "a/dev/null" on some > otherwise valid input, and so assumed that was something we were able to > work around in applying the patch. But testing again today, I can't seem > to get anything except this broken diff to say "a/dev/null". So probably > I was just mistaken yesterday. Ah, I found it. It was the fact that _text_ diffs using --no-index also generate that bogosity. But of course they work fine, since we just reject the "default" name from the "diff --git" line, but end up getting the names from the "---" and "+++" lines. -Peff ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2008-10-05 19:25 UTC | newest] Thread overview: 10+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2008-10-02 18:27 git apply: git diff header lacks filename information for git diff --no-index patch Imre Deak 2008-10-04 4:17 ` Jeff King 2008-10-04 8:28 ` Jakub Narebski 2008-10-05 19:19 ` Jeff King 2008-10-04 16:54 ` Linus Torvalds 2008-10-04 17:08 ` Linus Torvalds 2008-10-04 17:48 ` Linus Torvalds 2008-10-05 19:21 ` Jeff King 2008-10-05 19:17 ` Jeff King 2008-10-05 19:24 ` Jeff King
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).