[fpc-pascal] Unicode file routines proposal

Marco van de Voort marcov at stack.nl
Tue Jul 1 09:26:32 CEST 2008


> > They have a UTF-16/UCS-2 internal representation, same as MSEgui which works 
> > very well and is fast and handy BTW.
> 
> And len, slicing, etc. work as expected.
> Note that if you need characters beyond $ffff you have to compile it
> with wide unicode support, and in that case every character will use 4
> bytes.
> 
That's IMHO a faulty system. It requires you to choose between an incomplete
solution or making strings a horrible memory hog. But maybe that doesn't
matter for mere scripting languages (though I wonder then why they didn't
chose UTF-32 directly)

Surrogates are not nice, but they were invented for a reason.




More information about the fpc-pascal mailing list