[fpc-devel] Unicode conversion routines
    Graeme Geldenhuys 
    graemeg.lists at gmail.com
       
    Sat Nov 22 19:55:22 CET 2008
    
    
  
On Sat, Nov 22, 2008 at 10:51 AM, Florian Klaempfl
<florian at freepascal.org> wrote:
> function UTF8Decode(const s : UTF8String): UnicodeString;
Is there some hard-coded limit on UTF8Decode?  I am writing some unit
tests for these methods and on my 3rd test, it already fails. I'm
using testing sample data from unicode.org.
var
  s8: UTF8String;
  s16: UnicodeString;
begin
  // U+289A8 CJK UNIFIED IDEOGRAPH-289A8
  s8 := Char($F0) + Char($A8) + Char($A6) + Char($A8);
  s16 := UTF8Decode(s8);
  AssertEquals('Failed on 4', 4, Length(s16));
  AssertEquals('Failed on 5', UnicodeChar($D862) + UnicodeChar($DDA8), s16)
end;
Test 4 fails: Expected 4 but was 0.
Test 5 fails: to due to Test 4 failure...
---------------[  from unicode description file ]-----------------------
U+289A8 CJK UNIFIED IDEOGRAPH-289A8
General Character Properties
Unicode category: Letter, Other
Various Useful Representations
UTF-8: 0xF0 0xA8 0xA6 0xA8
UTF-16: 0xD862 0xDDA8
------------------------[ end ]----------------------------
Regards,
  - Graeme -
_______________________________________________
fpGUI - a cross-platform Free Pascal GUI toolkit
http://opensoft.homeip.net/fpgui/
    
    
More information about the fpc-devel
mailing list