[fpc-devel] String and UnicodeString and UTF8String

Michael Schnell mschnell at lumino.de
Wed Jan 12 10:24:04 CET 2011


On 01/11/2011 05:19 PM, Hans-Peter Diettrich wrote:
>
> IMO a single encoding, i.e. UTF-8, can cover all cases.

Of course you are right here, but there are some things to be considered:

In Windows (and maybe elsewhere, too) a two-Byte API (e.g. UTF-16) needs 
to be used, forcing lots of conversions when doing GUI applications.

_All_ beginners will use s[i] and expect to get a character without any 
afterthought. They will be very disappointed when not using English if 
they get bytes instead of characters. The count of the frustrated will 
be much smaller (but >Zero) when doing Widestring/Widechar and they get 
Words instead of Characters.

Eliminating the s[i] syntax would trash  a lot of legacy code and the 
decent replacement (finding the correct character and moving it into a 
DWord in UCS4) is slow and still does not handle all the funny Unicode 
character-combining stuff. But the count of frustrated beginners might 
be further reduced.

-Michael



More information about the fpc-devel mailing list