Re: [PATCH] UTF-8ifying the kernel source

From: H. Peter Anvin
Date: Mon Mar 08 2004 - 19:32:20 EST


Followup to: <1078571331.963.3.camel@xxxxxxxxxxxxxxxx>
By author: Xavier Bestel <xavier.bestel@xxxxxxx>
In newsgroup: linux.dev.kernel
>
> Le sam 06/03/2004 à 00:33, H. Peter Anvin a écrit :
> > Followup to: <20040305232425.GA6239@xxxxxxxxxxxxx>
> > By author: David Eger <eger@xxxxxxxxxxxxx>
> > In newsgroup: linux.dev.kernel
> >
> > > The third patch concerns 8-bit characters embedded in C strings.
> > > These are almost always output to devfs or proc. The characters used are
> > > the degrees symbol (for ppc temp. sensors) and mu (for micro-seconds).
> >
> > I would highly vote for making those UTF-8 unless it breaks protocol.
>
> ISO-8859-1 characters are mostly the same in UTF-8.
>

Unicode, yes. UTF-8, no. The ISO-8859-1 character "Å" (0xC5) does,
indeed correspond to Unicode character U+00C5, but it's encoded 0xC3
0x85 in UTF-8.

-hpa
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/