public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: kernel@mikebell.org
To: Dave Kleikamp <shaggy@austin.ibm.com>
Cc: linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: JFS default behavior / UTF-8 filenames
Date: Thu, 19 Feb 2004 15:47:47 -0800	[thread overview]
Message-ID: <20040219234746.GG432@tinyvaio.nome.ca> (raw)
In-Reply-To: <1077199506.2275.12.camel@shaggy.austin.ibm.com>

On Thu, Feb 19, 2004 at 08:05:06AM -0600, Dave Kleikamp wrote:
> The arbitrary string of bytes is treated as the latin1 charset in that
> it is stored as 0x00nn (in UTF2), but JFS really doesn't care what the
> character set is.

While I don't really care one way or the other about the whole
"rejecting non-UTF8 filenames" thing, trying to store 8bit strings in
UTF2 (no such thing, is there? Is JFS UCS-2 or UTF-16?) seems really
ugly. In general at least, maybe it's not so bad in JFS's case
specifically because of there not being much sharing of JFS filesystems
between linux and non-linux systems.

But if JFS uses that "make the high byte zero and return the low byte
only" scheme, what does it do when it encounters a UCS-2 filename that
has a non-NUL high byte on an existing filesystem? I can't see any ways
of dealing with this that aren't much more horribly broken than merely
refusing to create filenames that aren't valid in the current encoding.
If it throws the high byte away then you've made it impossible to open
said files, and up to 256 files per character of the filename can now
appear to have the same filename.

So what does JFS do in its "throw away the high byte and store binary
character strings in the low byte" mode? How does it deal with an
existing filesystem that has filenames that don't conform to said rule?

  reply	other threads:[~2004-02-19 23:47 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-02-15 23:03 JFS default behavior Nicolas Mailhot
2004-02-16  3:45 ` Jan Knutar
2004-02-16  8:30   ` Nicolas Mailhot
2004-02-16  8:54     ` Valdis.Kletnieks
2004-02-16  6:21 ` jw schultz
2004-02-16 15:55   ` Jamie Lokier
2004-02-17  6:47     ` jw schultz
2004-02-17 21:37       ` Jamie Lokier
2004-02-17 22:12         ` Linus Torvalds
2004-02-18  9:59           ` Jamie Lokier
2004-02-18 15:54             ` Linus Torvalds
2004-02-18 23:58               ` Jamie Lokier
2004-02-19 10:59 ` JFS default behavior / UTF-8 filenames kernel
2004-02-19 14:05   ` Dave Kleikamp
2004-02-19 23:47     ` kernel [this message]
2004-02-20 15:00       ` Dave Kleikamp
2004-02-22 19:22         ` kernel
2004-02-24 14:44           ` Dave Kleikamp

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20040219234746.GG432@tinyvaio.nome.ca \
    --to=kernel@mikebell.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=shaggy@austin.ibm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox