[fpc-devel] fpdoc and unicode characters

Michael Van Canneyt michael at freepascal.org
Sat Aug 23 14:01:30 CEST 2008


> 
> 
> On Thu, 14 Aug 2008, Graeme Geldenhuys wrote:
> 
> > Hi,
> 
> In researching how to type Unicode characters on different platforms,
> I came across an interesting argument regarding Unicode characters and
> HTML.  The argument might apply to fpdoc documentation (xml) files as
> well—hence the reason for this post.
> 
> With W3C embracing UTF-8 as the de facto standard for HTML pages, do
> we still need to escape characters like ampersand ['  U+2019] to
> [&] etc.  Unicode has been around for some time now, so surely all
> half-decent software should be able to read and display the actual
> character correctly by now (sensitive subject for FPC and Delphi at
> the moment), instead of having to bother with the escaped version.
> 
> How does this argument fit with XML which also uses UTF-8 as the de
> facto standard encoding. And seeing that fpdoc uses XML for the
> documentation files, can I use the actual Unicode characters in my
> fpdoc documentation, or must I still stick with the—what now seems to
> be outdated—escaped method?
> 
> BTW:
> These are the characters I was interested in.
> — (U+2014): emphasis dash
> … (U+2026): horizontal ellipses
> ' (U+2019): right single quotation
> " (U+201C): left double quotation
> " (U+201D): right double quotation
> ― (U+2015): quotation dash (introducing quoted text)

Fixed, with the help of Sergei Gorelkin. (bug id 11881)

Michael.


More information about the fpc-devel mailing list