[fpc-pascal] XML DOM and HTML
Johannes Nohl
johannes.nohl at gmail.com
Sat Jun 7 23:37:33 CEST 2008
Dear list,
I player around with the units dom and xmlread. I liked them very
much. Now I thought I could parse websites with it. But they are
slightly different as far as I know. In xml everthing is within a node
while in HTML there are more then one value in a node. E.g.:
possible XML:
<div>
asdf1
<span>qwer1</span>
<span>qwer2</span>
</div>
HTML:
<div>
asdf1
<span>qwer1</span>
asdf2
<span>qwer2</span>
asdf3
</div>
Using XML-Dom I can access Value "asdf1" only. I think second example
is not valid XML, or?
Has anybody used XML to parse HTML-files? Is there a unit?
Thanks four your help!
More information about the fpc-pascal
mailing list