[fpc-devel] Unicode functions
Daniël Mantione
daniel.mantione at freepascal.org
Tue Aug 26 11:59:16 CEST 2008
Op Tue, 26 Aug 2008, schreef Graeme Geldenhuys:
> On 8/26/08, Daniël Mantione <daniel.mantione at freepascal.org> wrote:
>>> defined as a word (2 bytes), so that means it's only UCS2 compliant
>>> and not full Unicode UTF-16 (which is what we want).
>>
>> For uppercasing/lowercasing it is correct to define a Unicode char as 2
>> bytes.
>
> Lets not limit ourselves again! Lets plan ahead this time. After all,
> I'm busy learning Klingon [wink], which falls outside the BMP range.
> UCS2 or word size is limited to the BMP only.
Klingon does not have capitals, nor does any script outside the BMP. And
this is intentionally to keep people their code simple. So there is no
"limiting ourselves".
And by the way, Klingon's is not part of Unicode, and unofficially #f8d0
to #f8ff which is inside the BMP.
Daniël
More information about the fpc-devel
mailing list