git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Git diff misattributes the first word of a line to the previous line
@ 2022-10-13  5:51 Gurjeet Singh
  2022-10-13  6:45 ` Johannes Sixt
  2022-10-13 12:16 ` Bagas Sanjaya
  0 siblings, 2 replies; 4+ messages in thread
From: Gurjeet Singh @ 2022-10-13  5:51 UTC (permalink / raw)
  To: git

Git diff seems to get confused about word boundaries, and includes the
first word from the next line.

In the output below, you'd be forgiven to assume that the word 'ab'
was removed from first line, and was replaced with the word 'opt1'.
But as you can see in the contents of file '1.txt', that word was on
the _second_ line.

It seems that the first word of a line gets attributed to the previous
line, ignoring the fact that there's an intervening newline before the
word.

This confusion is exhibited by the --color-words option, too, where I
discovered it first. But when trying to create a reproducible test
case, I discovered that this may be a problem with the word-boundary
identification, and the --word-diff=plain is a better way to
demonstrate the bug.

I have also eliminated the possibility that this may be due to some
misconfiguration in my GIt config files, by setting some environment
variables, as seen in the last command.

$ git diff --word-diff=plain /tmp/1.txt /tmp/2.txt
diff --git a/tmp/1.txt b/tmp/2.txt
index 8239f93..099fb80 100644
--- a/tmp/1.txt
+++ b/tmp/2.txt
@@ -1,2 +1,2 @@
    x = yz [-ab-]{+opt1+}
{+    ac+} = [-cd ef-]{+pq opt2+}

$ cat /tmp/1.txt
    x = yz
    ab = cd ef

$ cat /tmp/2.txt
    x = yz opt1
    ac = pq opt2

# Git installed on macOS, via Nixpkgs
$ git --version
git version 2.35.1

# Also tested on Git installed via Homebrew
$ /usr/local/bin/git --version
git version 2.38.0

# Try to run with a clean environment.
$ export GIT_CONFIG_GLOBAL=/dev/null
$ export GIT_CONFIG_SYSTEM=/dev/null
$ export GIT_CONFIG_NOSYSTEM=yes
$ git diff --word-diff=plain /tmp/1.txt /tmp/2.txt # Same buggy output

# **** Expected **** output
$ git diff --word-diff=plain /tmp/1.txt /tmp/2.txt
diff --git a/tmp/1.txt b/tmp/2.txt
index 8239f93..099fb80 100644
--- a/tmp/1.txt
+++ b/tmp/2.txt
@@ -1,2 +1,2 @@
    x = yz {+opt1+}
    [-ab-]{+ac+} = [-cd ef-]{+pq opt2+}

Best regards,
Gurjeet
http://Gurje.et

^ permalink raw reply related	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2022-10-13 12:16 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-10-13  5:51 Git diff misattributes the first word of a line to the previous line Gurjeet Singh
2022-10-13  6:45 ` Johannes Sixt
2022-10-13 11:30   ` Philip Oakley
2022-10-13 12:16 ` Bagas Sanjaya

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).