From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dave Chinner Subject: Re: [PATCH 06/10] xfs: add unicode character database files Date: Tue, 23 Sep 2014 06:54:38 +1000 Message-ID: <20140922205438.GM4267@dastard> References: <20140918195650.GI19952@sgi.com> <20140918201440.GI4482@sgi.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: linux-fsdevel@vger.kernel.org, tinguely@sgi.com, olaf@sgi.com, xfs@oss.sgi.com To: Ben Myers Return-path: Received: from ipmail06.adl6.internode.on.net ([150.101.137.145]:46311 "EHLO ipmail06.adl6.internode.on.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753128AbaIVUym (ORCPT ); Mon, 22 Sep 2014 16:54:42 -0400 Content-Disposition: inline In-Reply-To: <20140918201440.GI4482@sgi.com> Sender: linux-fsdevel-owner@vger.kernel.org List-ID: On Thu, Sep 18, 2014 at 03:14:40PM -0500, Ben Myers wrote: > From: Olaf Weber > > Add files from the Unicode Character Database, version 7.0.0, to the source. > A helper program that generates a trie used for normalization from these > files is part of a separate commit. > > Signed-off-by: Olaf Weber > --- > [v2: Removed large unicode files prior to posting. Get them as below. -bpm] > [v3: Moved files to ucd8norm directory. -bpm] > > cd fs/xfs/utf8norm/ucd > wget http://www.unicode.org/Public/7.0.0/ucd/CaseFolding.txt > wget http://www.unicode.org/Public/7.0.0/ucd/DerivedAge.txt > wget http://www.unicode.org/Public/7.0.0/ucd/extracted/DerivedCombiningClass.txt > wget http://www.unicode.org/Public/7.0.0/ucd/DerivedCoreProperties.txt > wget http://www.unicode.org/Public/7.0.0/ucd/NormalizationCorrections.txt > wget http://www.unicode.org/Public/7.0.0/ucd/NormalizationTest.txt > wget http://www.unicode.org/Public/7.0.0/ucd/UnicodeData.txt > for e in *.txt > do > base=`basename $e .txt` > mv $e $base-7.0.0.txt > done > --- > fs/xfs/utf8norm/ucd/README | 33 +++++++++++++++++++++++++++++++++ This probably needs to live somewhere under lib/. There's nothing XFS specific in it and the translations should be the same for anything that wants to parse unicode. Cheers, Dave. -- Dave Chinner david@fromorbit.com