[fpc-pascal] Re: html scrapping library
joshyfun at gmail.com
Tue Feb 15 19:20:17 CET 2011
Tuesday, February 15, 2011, 6:08:29 PM, you wrote:
i> I have a spelling error, it's HTML scraping, it means to read HTML (and few
i> other contents on the web), and extract information using for example CSS
i> elements (that is #id .class:first) will brings the first data inside of
i> "id" of the element with the "class" content.
i> Another way is to extract information using XPath (for xml).
i> At the end it's a data mining for specific type of information.
Almost all scrapers I had used in the past were built using regular
expressions, so the regexp engine should be more than enought.
More information about the fpc-pascal