[fpc-pascal] stripping HTML

Roland Schäfer roland.schaefer at fu-berlin.de
Sat Apr 16 14:12:36 CEST 2011


Hello everyone,
is there any existing FPC code (even external libraries with bindings)
to strip HTML tags from files, including adequate removal of scripts,
comments and other multi-line non-text - and which handles faulty HTML
input in a tolerant fashion? I also need to keep track of how many
characters per line were removed. Thanks in advance.
Regards - Roland



More information about the fpc-pascal mailing list