Re: [2.6 patch] UTF-8 fixes in comments

From: H. Peter Anvin
Date: Tue Apr 29 2008 - 15:34:25 EST


Alan Cox wrote:

Funny that you mention Windows. Windows has been using 16-bit unicode
for a long time without problems. It's a clean encoding. Like it or not.

I would describe the UCS-2 situation as a disaster area - embedded nuls
causing breakage, inability to represent the full unicode space and
awkward programming interfaces.


Not to mention the fact that UCS-2 ran out of code points almost as soon as they said "no more codepoints." The result was UTF-16, a hideous abortion which took all the problems with wide encodings, combined it with all the problems of multibyte encodings, and added a few new ones for good measure.

-hpa
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/