From: Gabriel Krisman Bertazi <krisman@collabora.co.uk>
To: Theodore Ts'o <tytso@mit.edu>
Cc: david@fromorbit.com, bpm@sgi.com, olaf@sgi.com,
linux-ext4@vger.kernel.org, linux-fsdevel@vger.kernel.org,
kernel@lists.collabora.co.uk, alvaro.soliverez@collabora.co.uk
Subject: Re: [PATCH RFC 03/13] charsets: utf8: Add unicode character database files
Date: Sat, 13 Jan 2018 02:28:05 -0200 [thread overview]
Message-ID: <87wp0mqt62.fsf@collabora.co.uk> (raw)
In-Reply-To: <20180113002407.GD8249@thunk.org> (Theodore Ts'o's message of "Fri, 12 Jan 2018 19:24:07 -0500")
Theodore Ts'o <tytso@mit.edu> writes:
> On Fri, Jan 12, 2018 at 05:12:24AM -0200, Gabriel Krisman Bertazi wrote:
>> From: Olaf Weber <olaf@sgi.com>
>>
>> Add files from the Unicode Character Database, version 7.0.0, to the source.
>> A helper program that generates a trie used for normalization from these
>> files is part of a separate commit.
>
> It looks like the latest version of Unicode is 10.0.0. Once we pick a
> Unicode version, changing will be painful; but in the absence of
> interop requirements, is there a reason to stick with Unicode 7? Why
> not take the latest version of Unicode and then freeze on it?
>
Hi Ted,
No, there isn't a specific reason for unicode 7 and I forgot to mention
this in my cover letter. I have successfully generated the data file
for 10.0.0 with the mkutf8data script, but I couldn't validate it
entirely yet. I walked through changelogs to make sure any relevant
changes where there, but I'm not done yet. You can definitely expect
new versions of the patchset to support 10.0.0.
Thanks,
--
Gabriel Krisman Bertazi
next prev parent reply other threads:[~2018-01-13 4:28 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-01-12 7:12 [PATCH RFC 00/13] UTF-8 case insensitive lookups for EXT4 Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 01/13] charsets: Introduce middle-layer for character encoding Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 02/13] charsets: ascii: Wrap ascii functions to charsets library Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 03/13] charsets: utf8: Add unicode character database files Gabriel Krisman Bertazi
2018-01-12 16:59 ` Darrick J. Wong
2018-01-12 20:29 ` Weber, Olaf (HPC Data Management & Storage)
2018-01-13 0:24 ` Theodore Ts'o
2018-01-13 4:28 ` Gabriel Krisman Bertazi [this message]
2018-01-12 7:12 ` [PATCH RFC 04/13] scripts: add trie generator for UTF-8 Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 05/13] charsets: utf8: Introduce code for UTF-8 normalization Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 06/13] charsets: utf8: reduce the size of utf8data[] Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 07/13] charsets: utf8: Hook-up utf-8 code to charsets library Gabriel Krisman Bertazi
2018-01-12 10:38 ` Weber, Olaf (HPC Data Management & Storage)
2018-01-16 16:50 ` Gabriel Krisman Bertazi
2018-01-16 22:19 ` Weber, Olaf (HPC Data Management & Storage)
2018-01-23 3:33 ` Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 08/13] charsets: utf8: Introduce test module for kernel UTF-8 implementation Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 09/13] ext4: Add ignorecase mount option Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 10/13] ext4: Include encoding information on the superblock Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 11/13] fscrypt: Introduce charset-based matching functions Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 12/13] ext4: Support charset name matching Gabriel Krisman Bertazi
2018-01-12 7:12 ` [PATCH RFC 13/13] ext4: Implement ext4 dcache hooks for custom charsets Gabriel Krisman Bertazi
2018-01-12 10:52 ` Weber, Olaf (HPC Data Management & Storage)
2018-01-12 16:56 ` [PATCH RFC 00/13] UTF-8 case insensitive lookups for EXT4 Jeremy Allison
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87wp0mqt62.fsf@collabora.co.uk \
--to=krisman@collabora.co.uk \
--cc=alvaro.soliverez@collabora.co.uk \
--cc=bpm@sgi.com \
--cc=david@fromorbit.com \
--cc=kernel@lists.collabora.co.uk \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=olaf@sgi.com \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.