From: "Randall S. Becker" <rsbecker@nexbridge.com>
To: "'Michal Suchánek'" <msuchanek@suse.de>
Cc: <git@vger.kernel.org>
Subject: RE: Need multibyte advice - Shift-JIS
Date: Wed, 27 Feb 2019 12:38:06 -0500 [thread overview]
Message-ID: <001b01d4cec3$364149e0$a2c3dda0$@nexbridge.com> (raw)
In-Reply-To: <20190227181458.4f019d91@kitsune.suse.cz>
On February 27, 2019 12:15, Michal Suchánek wrote:
> To: Randall S. Becker <rsbecker@nexbridge.com>
> Cc: git@vger.kernel.org
> Subject: Re: Need multibyte advice - Shift-JIS
>
> On Wed, 27 Feb 2019 12:03:58 -0500
> "Randall S. Becker" <rsbecker@nexbridge.com> wrote:
>
> > On February 27, 2019 11:52, Michal Suchánek wrote:
> > > To: Randall S. Becker <rsbecker@nexbridge.com>
> > > Cc: git@vger.kernel.org
> > > Subject: Re: Need multibyte advice - Shift-JIS
> > >
> > > On Wed, 27 Feb 2019 11:33:47 -0500
> > > "Randall S. Becker" <rsbecker@nexbridge.com> wrote:
> > >
> > > > On February 27, 2019 11:29 Michal Suchánek wrote:
> > > > > On Wed, 27 Feb 2019 11:19:33 -0500 "Randall S. Becker"
> > > > > <rsbecker@nexbridge.com> wrote:
> > > > >
> > > > > > On February 27, 2019 11:11, Michal Suchánek wrote:
> > > > > > > On Wed, 27 Feb 2019 10:54:23 -0500 "Randall S. Becker"
> > > > > > > <rsbecker@nexbridge.com> wrote:
> > > > > > >
> > > > > > > > On February 27, 2019 9:09, Michal Suchánek wrote:
> > > > > > > > > On Wed, 27 Feb 2019 08:04:08 -0500 "Randall S. Becker"
> > > > > > > > > <rsbecker@nexbridge.com> wrote:
> > > > > > > > >
> > > > > > > > > > Hi Git Team,
> > > > > > > > > >
> > > > > > > > > > I have to admit being perplexed by this one. I have
> > > > > > > > > > been asked to support the Shift-JIS character set in
> > > > > > > > > > file contents, comments, and logs, for a partner of
> > > > > > > > > > mine. I know there are a few ways to do this, but I'm
> > > > > > > > > > looking for the official non-hacky way
> > > > > to do this.
> > > > > > > > > > This is CLI only, and our pager, less, does not
> > > > > > > > > > support multi-byte, so I'm looking
> > > > > > > > for
> > > > > > > > > options there also.
> > > > > > > > >
> > > > > > > > > SJIS is about as much multibyte as UTF-8.
> > > > > > > > >
> > > > > > > > > Why do you think less does not support it?
> > > > > > > > >
> > > > > > > > > Last time I looked there was SJIS locale for libc so it
> > > > > > > > > is only matter of generating the correct locales and
> > > > > > > > > using them. Of course, if you are
> > > > > > > > running
> > > > > > > > > in UTF-8 SJIS will look like garbage.
> > > > > > > >
> > > > > > > > Sadly, I did not personally build less on this platform,
> > > > > > > > and the libc used did not include UTF-16, on the platform
> > > > > > > > vendor supplied less. cat works fine, but the usual
> > > > > > > > LESSCHARSET=utf-16 is unsupported, so I am looking for an
> > > > > > > > alternative. THAT is why I think less does not support it.
> > > > > > > > Sorry, I should have made that more
> > > > > clear.
> > > > > > > >
> > > > > > > > cat works fine, so if I set GIT_PAGER=cat, I can at least
> > > > > > > > see the diffs cleanly in SJIS, but this partner wants a pager that is
> usable.
> > > > > > > >
> > > > > > >
> > > > > > > So you want to use SJIS because UTF-16 is not supported. So
> > > > > > > what is the problem with SJIS (or UTF-8 for that matter)?
> > > > > >
> > > > > > The partner I am working with is using multi-byte SJIS, which
> > > > > > is also not
> > > > > supported by this incarnation of less. As a result, UTF-8 does
> > > > > not work either in this situation. The content is definitely multi-byte.
> > > > > I know this was fixed in RedHat's Less in 2016, but did not make
> > > > > this
> > > platform.
> > > > > >
> > > > >
> > > > > Both UTF-8 and SJIS is multibyte and both is supported by less
> > > > > in general. If your particular less cannot support it then it is
> > > > > broken and you should fix it or get it fixed.
> > > >
> > > > To be more specific, the implementation of less' UTF-8 on this
> > > > platform will
> > > present the data as unusable junk as expected. SJIS is multi-byte,
> > > but is not one of the allowed encodings in less. I am not empowered to
> "get it fixed".
> > > Thanks for your advice.
> > > >
> > >
> > > How is this 'allowed encodings in less' defined?
> >
> > When you run less with LESSCHARSET=encoding, if the encoding is not
> known, you get the error:
> > invalid charset name
> >
> > Doing the due diligence, I actually read the man page on the platform
> before asking the question, which listed the following as the only allowed
> encodings: ascii, iso8859, latin1, latin9, dos, IBM-1047, koi8-r, next, utf-8,
> windows. The utf-8 variant does not know how to display its multi-byte
> forms in SJIS, as with other platforms. Does that make sense now?
> >
>
> Does the said man page also mention LESSCHARDEF or LESSOPEN?
Of course it does.
next prev parent reply other threads:[~2019-02-27 17:38 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-02-27 13:04 Need multibyte advice - Shift-JIS Randall S. Becker
2019-02-27 14:08 ` Michal Suchánek
2019-02-27 15:54 ` Randall S. Becker
2019-02-27 16:11 ` Michal Suchánek
2019-02-27 16:19 ` Randall S. Becker
2019-02-27 16:28 ` Michal Suchánek
2019-02-27 16:33 ` Randall S. Becker
2019-02-27 16:51 ` Michal Suchánek
2019-02-27 17:03 ` Randall S. Becker
2019-02-27 17:14 ` Michal Suchánek
2019-02-27 17:38 ` Randall S. Becker [this message]
2019-02-27 17:50 ` Michal Suchánek
2019-02-27 17:59 ` Randall S. Becker
2019-02-27 18:18 ` Michal Suchánek
2019-02-27 18:50 ` Randall S. Becker
2019-02-27 18:59 ` Michal Suchánek
2019-02-27 19:36 ` Johannes Sixt
2019-02-27 19:53 ` Randall S. Becker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='001b01d4cec3$364149e0$a2c3dda0$@nexbridge.com' \
--to=rsbecker@nexbridge.com \
--cc=git@vger.kernel.org \
--cc=msuchanek@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.