[fpc-devel] Unicode functions

Daniël Mantione daniel.mantione at freepascal.org
Tue Aug 26 11:59:16 CEST 2008



Op Tue, 26 Aug 2008, schreef Graeme Geldenhuys:

> On 8/26/08, Daniël Mantione <daniel.mantione at freepascal.org> wrote:
>>> defined as a word (2 bytes), so that means it's only UCS2 compliant
>>> and not full Unicode UTF-16 (which is what we want).
>>
>>  For uppercasing/lowercasing it is correct to define a Unicode char as 2
>> bytes.
>
> Lets not limit ourselves again! Lets plan ahead this time.  After all,
> I'm busy learning Klingon [wink], which falls outside the BMP range.
> UCS2 or word size is limited to the BMP only.

Klingon does not have capitals, nor does any script outside the BMP. And 
this is intentionally to keep people their code simple. So there is no 
"limiting ourselves".

And by the way, Klingon's is not part of Unicode, and unofficially #f8d0 
to #f8ff which is inside the BMP.

Daniël


More information about the fpc-devel mailing list