[fpc-pascal] fpc-xml: SAX_HTML parsing "script" tags
dmitry boyarintsev
skalogryz.lists at gmail.com
Sun May 16 21:52:09 CEST 2010
Hello
Is there any known way to parse html files with:
<script>
some script with tricky symbols , like "< >" that ruins the parsing
</script> tag?
I've not found any better option but to re-implement THTMLReader
(copy-paste + modify) and run a custom parser to skip the content of
the script tag, and then go-on parsing html file.
Also, is there any reason for f parameter to be "var"?
procedure ReadHTMLFile(var ADoc: THTMLDocument; var f: TStream);
procedure ReadHTMLFragment(AParentNode: TDOMNode; var f: TStream);
thanks,
Dmitry
More information about the fpc-pascal
mailing list