[fpc-devel] fpdoc and unicode characters

Marco van de Voort marcov at stack.nl
Thu Aug 14 13:14:38 CEST 2008

In our previous episode, Graeme Geldenhuys said:
> In researching how to type Unicode characters on different platforms,
> I came across an interesting argument regarding Unicode characters and
> HTML.  The argument might apply to fpdoc documentation (xml) files as
> well?hence the reason for this post.
> With W3C embracing UTF-8 as the de facto standard for HTML pages, do
> we still need to escape characters like ampersand ['  U+2019] to
> [&] etc.  Unicode has been around for some time now, so surely all
> half-decent software should be able to read and display the actual
> character correctly by now (sensitive subject for FPC and Delphi at
> the moment), instead of having to bother with the escaped version.

Yes, since that is not related to encoding, but because HTML uses & as a
control character.
> How does this argument fit with XML which also uses UTF-8 as the de
> facto standard encoding. And seeing that fpdoc uses XML for the
> documentation files, can I use the actual Unicode characters in my
> fpdoc documentation, or must I still stick with the?what now seems to
> be outdated?escaped method?

Depends. Is & a steering character in all of XML, or only the xhtml like

