[fpc-devel] Unicode Letters

ik idokan at gmail.com
Thu Jul 17 07:50:06 CEST 2008


In hebrew (at least) the punctuation is a different char that comes
after the letter, but painted like it was part of the letter, so you
can parse each word and ignore non letter value (it arrives in
different range in the unicode table).

Ido

On Wed, Jul 16, 2008 at 10:09 PM, theo <xpde at theo.ch> wrote:
> Is there a way to separate unicode letters from punctuation and the like?
> The reason is simple: I would like to separate words in a text for a
> spell-checker.
> I see there are tables which list unicode categories
> http://www.sql-und-xml.de/unicode-database/#kategorien
> Is there already something for freepascal to get such information?
> Is there a better way to do what I need?
>
> Thanks
> Theo
>
>
> _______________________________________________
> fpc-devel maillist  -  fpc-devel at lists.freepascal.org
> http://lists.freepascal.org/mailman/listinfo/fpc-devel
>



-- 
http://ik.homelinux.org/



More information about the fpc-devel mailing list