[fpc-devel] utf8 reading
uberto at ubiland.net
Thu Mar 10 19:29:40 CET 2005
> UCS-2 or UTF-16 how it called by the unicode consortium is "escaped" as
> well and you've to take care of it in your code.
UCS-2 is different from utf-16 (which is escaped), but you cannot represent
all utf characters (see the case of Vogon poetry).
> Even in utf-32 you've
> to take care of surrogate pairs.
In utf-32 yes, in UCS-4 no.
Teorically we could have a UCS-8 in the future, but for the next hundred years
this is not very likely.
> > Using natively utf-8 I think is impossible, because the encoding.
Because every simple function on strings (like copy) should require to start
reading the string from beginning
More information about the fpc-devel