Re: unicode (char as abstract data type)

Dmitry Yaitskov (dima@interlog.com)
Sat, 18 Apr 1998 10:12:18 -0400


On Saturday, Apr 18, Alex Belits (abelits@phobos.illtel.denver.co.us) spake thusly:
> On Sat, 18 Apr 1998, Albert D. Cahalan wrote:
>
> > > To convert what? We have multiple encodings because we have multiple
> > > languages, and conversion through Unicode is useful only within the
> > > language because otherwise there will be nothing to map into. koi8-r and
> > > iso8859-1 charsets have no common characters except the 7-bit ASCII range.
> >
> > You have multiple Russian encodings.
>
> I certainly don't. If anything for any reason can arrive at my host in
> Russian, in encoding other than koi-8, it gets converted once (and yes,
> UTF-8 is used as intermediate format). The same happens with all properly
> installed Unix boxes with cyrillic support.

Could you please explain how's it done? (it's a genuine question, not
an argument)... (you probably could answer me via email as this has
nothing to do with kernel). Thanx.

<snip>
> > People share both disk and network filesystems with other OSs.
>
> ...and in that case they have to keep the same charset on all systems
> that are supposed to read files -- for simple reason of being capable of
> understanding the data in their files.

Well, that's not exactly true - an obvious example are a lot of web
pages either having multiple pages for different encodings, or
on-the-fly translators...

-- 
Cheers,
 -Dima.

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu