[fpc-pascal] Re: html link extractor
L at z505.com
Tue Jul 3 10:11:14 CEST 2007
> > Is there a unit somewhere that can extract links from html pages? I want
> > to be able to recursively add pages to a chm archive.
> I created a program called GetLinks in a couple minutes:
Updated the files and changed the htmlutil functions a bit.
Also, created a recursive example that uses Synapse.. and grabs web links
*infinitely* until it finds no more links (using nested pascal function for the
recursion, Torvalds hates those).
The recursive demo probably won't work with file:// style links since it invokes
Synapse, and only simple http relative paths work as is. (not sure if CHM files
use file:// style links, guessing).
Latest download includes recursive extractor and getlinks demo:
More information about the fpc-pascal