[fpc-pascal] UTF-8 versions of Copy() and Length()

Felipe Monteiro de Carvalho felipemonteiro.carvalho at gmail.com
Sat May 19 11:57:44 CEST 2007


On 5/19/07, Rimgaudas Laucius <rimga at ktl.mii.lt> wrote:
> It is not useful to have functions for both encodings, because these
> encodings are interconvertable and it is more effective to use UTF-16 for
> data processing

I disagree. The conversion impacts performance heavely. It will also
require memory to store the converted string, and after you perform a
operation you need to convert back.

Further, UTF-16 contains both 2-byte characters and 4-byte characters,
so I don't see how it would be any faster to process it in comparison
to process a utf-8 string.

About being easier to implement, that's irrelevant, because the
functions are already done.

-- 
Felipe Monteiro de Carvalho



More information about the fpc-pascal mailing list