[fpc-devel] Re: enumerators

Marco van de Voort marcov at stack.nl
Thu Nov 18 14:31:03 CET 2010


In our previous episode, Michael Schnell said:
> > found by a dumb byte/char scan; only few encodings have to be 
> > recognized and handled, based on the char size: MBCS (UTF-8...), 
> > WideChars (UTF-16/UCS2) and UTF-32.
> >
> In fact I suppose that for UTF-8 ("pure UTF-8" without surrogates) pos() 
> works for all strings and an UTF-8 "character" is a string.

Either you have UTF-8 with surrogates, or you have ASCII (since UTF-8
without surrogates means that only char 0..127 are valid, which is ASCII)




More information about the fpc-devel mailing list