From: dwmw2@infradead.org (David Woodhouse)
To: linux-arm-kernel@lists.infradead.org
Subject: Sending UTF-8 patches (was: [PATCH 2/2] Remove now-defunct ts7250 nand driver)
Date: Wed, 06 Jan 2010 23:43:00 +0000 [thread overview]
Message-ID: <1262821380.3181.8838.camel@macbook.infradead.org> (raw)
In-Reply-To: <20100106232128.GE24250@shareable.org>
On Wed, 2010-01-06 at 23:21 +0000, Jamie Lokier wrote:
> > Personally, I suspect you're right, and it should be converted too.
>
> It would need to optional for git users whose source code isn't UTF-8 -
> possibly converting the other way for them. But yeah I think it'd make
> sense to be on by default.
It's silly to talk of 'converting the other way'. The tool converts from
the charset of the email, to the charset that the git repository is
configured for (which is UTF-8 by default and in all sane cases).
Why would you want to convert the other way?
It is currently optional, but I suspect that's the wrong approach. The
only reason you'd ever want that is if the mail it's interpreting is
mislabelled -- and in that case surely the best workaround is an option
which lets you override the Content-Type: header, not just disable the
charset conversion altogether. Obviously if you override the input
charset to be equal to your repository configuration, that means that no
conversion is done. But that's just a fairly unimportant special case of
the override, surely?
> Section 4.1.2, Charset Parameter, final paragraph:
>
> >> In general, composition software should always use the "lowest common
> >> denominator" character set possible. For example, if a body contains
> >> only US-ASCII characters, it SHOULD be marked as being in the US-
> >> ASCII character set, not ISO-8859-1, which, like all the ISO-8859
> >> family of character sets, is a superset of US-ASCII. More generally,
> >> if a widely-used character set is a subset of another character set,
> >> and a body contains only characters in the widely-used subset, it
> >> should be labelled as being in that subset. This will increase the
> >> chances that the recipient will be able to view the resulting entity
> >> correctly.
>
> It's a SHOULD, but it's still a good idea. ISO-8859-1 is still very
> widely-used for email.
That might have made sense 13 years ago, but today I would suggest that
we ought to be applying a general principle that UTF-8 SHOULD be used
everywhere.
By doing so, we mostly sidestep the need for the whole charset labelling
and conversion clusterfuck that nobody ever quite managed to get right
anyway. If everything on the system is UTF-8, all the time, then it
doesn't _matter_ that nobody ever really managed to get labelling to
work.
>From RFC2119:
3. SHOULD This word, or the adjective "RECOMMENDED", mean that there
may exist valid reasons in particular circumstances to ignore a
particular item, but the full implications must be understood and
carefully weighed before choosing a different course.
I understand the full implications of not using legacy character sets,
and have carefully weighed them before choosing to send UTF-8.
What's the down-side? That some Luddite out there might have a system
which _still_ can't render UTF-8 email in 2010, so they're only seeing
the 99.999% of the email which is readable as ASCII?
--
dwmw2
next prev parent reply other threads:[~2010-01-06 23:43 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-01-05 21:59 [PATCH 2/2] Remove now-defunct ts7250 nand driver H Hartley Sweeten
2010-01-06 13:31 ` David Woodhouse
2010-01-06 17:26 ` H Hartley Sweeten
2010-01-06 17:42 ` David Woodhouse
2010-01-06 17:47 ` H Hartley Sweeten
2010-05-06 16:47 ` H Hartley Sweeten
2010-05-07 6:01 ` Artem Bityutskiy
2010-05-07 16:37 ` H Hartley Sweeten
2010-01-06 18:07 ` Sending UTF-8 patches (was: [PATCH 2/2] Remove now-defunct ts7250 nand driver) Jamie Lokier
2010-01-06 18:36 ` David Woodhouse
2010-01-06 19:01 ` Nicolas Pitre
2010-01-06 23:21 ` Jamie Lokier
2010-01-06 23:43 ` David Woodhouse [this message]
2010-01-06 18:55 ` Nicolas Pitre
2010-01-06 23:05 ` Jamie Lokier
2010-01-06 23:08 ` David Woodhouse
2010-01-06 23:27 ` Jamie Lokier
2010-01-06 23:50 ` David Woodhouse
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1262821380.3181.8838.camel@macbook.infradead.org \
--to=dwmw2@infradead.org \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).