[fpc-pascal] Unicode file routines proposal
Marco van de Voort
marcov at stack.nl
Tue Jul 1 09:26:32 CEST 2008
> > They have a UTF-16/UCS-2 internal representation, same as MSEgui which works
> > very well and is fast and handy BTW.
>
> And len, slicing, etc. work as expected.
> Note that if you need characters beyond $ffff you have to compile it
> with wide unicode support, and in that case every character will use 4
> bytes.
>
That's IMHO a faulty system. It requires you to choose between an incomplete
solution or making strings a horrible memory hog. But maybe that doesn't
matter for mere scripting languages (though I wonder then why they didn't
chose UTF-32 directly)
Surrogates are not nice, but they were invented for a reason.
More information about the fpc-pascal
mailing list