From: Yann Droneaud <yann@droneaud.fr>
To: git <git@vger.kernel.org>
Subject: Re: [PATCH] t9129: fix UTF-8 locale detection
Date: Mon, 24 May 2010 19:08:08 +0200 [thread overview]
Message-ID: <1274720888.4838.13.camel@localhost> (raw)
In-Reply-To: <1274202486.4228.22.camel@localhost>
Le mardi 18 mai 2010 à 19:08 +0200, Yann Droneaud a écrit :
> Le mardi 18 mai 2010 à 18:05 +0200, Michael J Gruber a écrit :
> > Yann Droneaud venit, vidit, dixit 18.05.2010 16:41:
> > > Since I don't have en_US.utf8, some tests failed:
>
> > >
> > > On my system locale -a reports:
> > >
> > > en_US
> > > en_US.ISO-8859-1
> > > en_US.UTF-8
> > >
> >
> > locale -a|grep en_US
> > en_US
> > en_US.iso88591
> > en_US.iso885915
> > en_US.utf8
> >
> > This is on Fedora 13, which is not exactly exotic. What is your system?
> >
>
I've checked carefully multiple system and configuration, and found why
we have some little locale problem here.
Since glibc 2.3, a file can hold all locales in file "locale-archive"
instead of having a tons of directory. To store all the locales in this
file, it uses an index based on a "normalized" codeset, e.g. it converts
codeset to lowercase, removes dash and minus.
So when one ask for the locale list, locale first go through the
"locale-archive" content and report normalized codeset (utf8) instead
of canonical codeset (UTF-8), then it proceed with the legacy locales
directories, using for them the canonical codeset.
Until recently, Mandriva Linux doesn't make use of "locale-archive", so
UTF-8 locales were reported. Version in development uses
"locale-archive" + legacy locale directories, hence the mix I've
reported. Other Linux distributions like Fedora and Ubuntu uses only
"locale-archive" and so, have only "normalized" codeset.
POSIX doesn't specify the output of locale -a, so it's not really a bug
to show "normalized" codeset name.
But all others "POSIX" system I've found report "canonical" codeset,
e.g. UTF-8 (all but latest cygwin).
Here's the bug report:
http://sourceware.org/bugzilla/show_bug.cgi?id=11629
BTW, I will shortly provided a fix for the testcase, which will handle
all cases.
Regards.
--
Yann Droneaud
next prev parent reply other threads:[~2010-05-24 17:08 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-05-18 14:41 [PATCH] t9129: fix UTF-8 locale detection Yann Droneaud
2010-05-18 16:05 ` Michael J Gruber
2010-05-18 17:08 ` Yann Droneaud
2010-05-18 17:16 ` Yann Droneaud
2010-05-18 17:45 ` Linus Torvalds
2010-05-18 19:58 ` Andreas Schwab
2010-05-18 20:00 ` Linus Torvalds
2010-05-18 20:49 ` Yann Droneaud
2010-05-19 15:16 ` Yann Droneaud
2010-06-02 19:14 ` Re* " Junio C Hamano
2010-05-24 17:08 ` Yann Droneaud [this message]
2010-05-25 7:33 ` Michael J Gruber
2011-01-06 14:22 ` [PATCH/RFC 0/4] en_US.UTF-8 " Yann Droneaud
2011-01-06 14:22 ` [PATCH 1/4] test: add a library to detect an en_US.UTF-8 locale Yann Droneaud
2011-01-07 18:37 ` Junio C Hamano
2011-01-06 14:22 ` [PATCH 2/4] test-lib.sh: add test_utf8() function Yann Droneaud
2011-01-07 18:45 ` Junio C Hamano
2011-01-06 14:22 ` [PATCH 3/4] test: use test_utf8 and GIT_LC_UTF8 where an en_US.UTF-8 locale is required Yann Droneaud
2011-01-07 18:48 ` Junio C Hamano
2011-01-06 14:22 ` [PATCH 4/4] t9129: use "$PERL_PATH" instead of "perl" Yann Droneaud
2011-01-07 18:49 ` Junio C Hamano
2010-05-18 20:37 ` [PATCH] t9129: fix UTF-8 locale detection Yann Droneaud
2010-05-19 2:44 ` Miles Bader
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1274720888.4838.13.camel@localhost \
--to=yann@droneaud.fr \
--cc=git@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).