> a) "ü": "LATIN SMALL LETTER U WITH DIAERESIS", encoded as $C3 $BC > b) "ü": "LATIN SMALL LETTER U", encoded as $75, followed by "COMBINING > DIAERESIS", which is encoded as $CC $88 I see, but I fail to see the sense of providing two different UTF8 code variants for the same unicode character. -Michael