On 2016-04-30 11:32, Martin Schreiber wrote: > One could say that utf-8 has surrogate pairs, surrogate triplets and surrogate > quads. No, don't confuse the point. As per the Unicode Standards definition of "surrogate pairs", UTF-8 and UTF-32 don't have surrogate pairs. Regards, Graeme