git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [JGIT PATCH 0/4] Decodings
@ 2008-10-12 22:50 Robin Rosenberg
  2008-10-12 22:50 ` [JGIT PATCH 1/4] log command: Use explicit US locale for dates Robin Rosenberg
  0 siblings, 1 reply; 8+ messages in thread
From: Robin Rosenberg @ 2008-10-12 22:50 UTC (permalink / raw)
  To: spearce; +Cc: git

These patches might be slightly controversial. Since there is no perfect
solution, we may want to try something that works to some extent and gives
what a user might expect, i.e. presenting to a viewer the same glyphs 
that the user who entered them saw, to the extent it is possible.

We already handle commit messages like this for the old style objects (sort of).
This patch set also affects other data like refs. Currenly no sane solution
exists in git so nothing really works well outside the non-ascii range for
refs anyway so we can discuss what should happen with refs that contain
non-ascii characters. The best thing is to avoid them, but some of us live
in countries with funny dots in what we do and other have even stranger ways
of expressing what they do, and hence things like branch names etc.

Legacy SCM to GIT conversion programs seem to do every variation of transcoding/
not transcoding commit messages and file names to UTF-8 so there is an issue here.
The nice thing about transcoding filenames to UTF-8 is that they work on all
platforms. A non-UTF-8 filename in a UTF-8 environement doesn't. In particular 
such filenames are more or less inaccessible to a Java programs. For the reverse
case it looks really bad. C Git currently does not transform file names. Missing from
this patch set is test cases. As it is quite undefined in git what happens that
is sort of ok so far, but I'd like to define it too in the same way.

-- robin

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2008-10-13 17:13 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-10-12 22:50 [JGIT PATCH 0/4] Decodings Robin Rosenberg
2008-10-12 22:50 ` [JGIT PATCH 1/4] log command: Use explicit US locale for dates Robin Rosenberg
2008-10-12 22:50   ` [JGIT PATCH 2/4] jgit programs: Use i18n.logOutputEncoding or user's locale for output Robin Rosenberg
2008-10-12 22:50     ` [JGIT PATCH 3/4] The git config file is case insensitive Robin Rosenberg
2008-10-12 22:51       ` [JGIT PATCH 4/4] Intelligent parsing of ambiguously encoded meta data Robin Rosenberg
2008-10-13  2:27         ` Shawn O. Pearce
2008-10-13 17:10           ` Robin Rosenberg
2008-10-13  2:36       ` [JGIT PATCH 3/4] The git config file is case insensitive Shawn O. Pearce

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).