[fpc-pascal] fpc-xml: SAX_HTML parsing "script" tags

dmitry boyarintsev skalogryz.lists at gmail.com
Sun May 16 21:52:09 CEST 2010


Hello

Is there any known way to parse html files with:
<script>
some script with  tricky symbols , like "< >" that ruins the parsing
</script> tag?

I've not found any better option but to re-implement THTMLReader
(copy-paste + modify) and run a custom parser to skip the content of
the script tag, and then go-on parsing html file.

Also, is there any reason for f parameter to be "var"?

procedure ReadHTMLFile(var ADoc: THTMLDocument; var f: TStream);
procedure ReadHTMLFragment(AParentNode: TDOMNode; var f: TStream);

thanks,
Dmitry



More information about the fpc-pascal mailing list