Re: unicode

Guest section DW (dwguest@win.tue.nl)
Sat, 16 May 1998 13:04:00 +0200 (MET DST)


tytso@mit.edu writes:

> However, by default, it is fair game for any future kernel extension for
> handling internationalization (probably using a UCS-2 or UCS-4
> interface) can assume that the ext2 file format is encoded using UTF-8.

Hmm, a difficult to read sentence.

Just for your entertainment, have you read
POSIX (ISO/IEC 9945-1: 1996) B.2.3.4 (5)?
(Don't be afraid, it is not a prescription, it is just a
discussion about common usage, where it is remarked that
many Unix systems use filenames in several character sets,
sometimes even a single filename uses several character sets.
Clearly, instead of planning a layer on top of ext2,
you plan to change ext2 into ext2a in such a way that
such Unix systems no longer can use ext2a. If these are
Linux systems, they'll use ext2, and Linux development will split.)

[Americans tend to underestimate the enormous cost in time
and money of a conversion. Every American would consider
a proposal to convert all filenames to EBCDIC ridiculous,
just impossible, but now that ASCII and UTF-8 happen to
coincide and Americans can convert for free, they talk
easily about the horrors they plan to inflict on the rest
of the world. Fortunately, for the time being, these plans
look like empty words.]

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu