[fpc-devel] utf8 reading
Uberto Barbini
uberto at ubiland.net
Thu Mar 10 19:29:40 CET 2005
> UCS-2 or UTF-16 how it called by the unicode consortium is "escaped" as
> well and you've to take care of it in your code.
mmh, no.
UCS-2 is different from utf-16 (which is escaped), but you cannot represent
all utf characters (see the case of Vogon poetry).
See:
http://www.uazone.com/multiling/unicode/ucs2.html
http://lists.samba.org/archive/jcifs/2002-July/000969.html
> Even in utf-32 you've
> to take care of surrogate pairs.
In utf-32 yes, in UCS-4 no.
Teorically we could have a UCS-8 in the future, but for the next hundred years
this is not very likely.
> > Using natively utf-8 I think is impossible, because the encoding.
>
> Why?
Because every simple function on strings (like copy) should require to start
reading the string from beginning
Bye Uberto
More information about the fpc-devel
mailing list