From your writing I understood that the issue is a UTF8 -> 21-bit-unicode decoding issue and has nothing to do with ISO/ANSI (which would render the problem thoroughly unsolvable, not only for the compiler builder but also for the application programmer, who wants to do a unicode aware program. -Michael