[fpc-pascal] Re: html scrapping library

José Mejuto joshyfun at gmail.com
Tue Feb 15 19:20:17 CET 2011


Hello FPC-Pascal,

Tuesday, February 15, 2011, 6:08:29 PM, you wrote:

i> I have a spelling error, it's HTML scraping, it means to read HTML (and few
i> other contents on the web), and extract information using for example CSS
i> elements (that is #id .class:first) will brings the first data inside of
i> "id" of the element with the "class" content.
i> Another way is to extract information using XPath (for xml).
i> At the end it's a data mining for specific type of information.

Almost all scrapers I had used in the past were built using regular
expressions, so the regexp engine should be more than enought.

-- 
Best regards,
 José




More information about the fpc-pascal mailing list