[fpc-pascal] PDF indexing
Marc Santhoff
M.Santhoff at web.de
Tue Jun 23 05:07:17 CEST 2015
On So, 2015-06-21 at 00:33 +0200, Michael Van Canneyt wrote:
>
> On Sat, 20 Jun 2015, Marc Santhoff wrote:
>
> > Hi,
> >
> > does fpc (or lazarus) have a helper class for indexing the content of
> > PDF files?
>
> check packages/fpindexer
>
> I have used it to create full text searches on a database.
> You should be able to adapt the base code to create an index of a PDF.
That looks pretty intresting. And it has some docs, wow.
If I understand correctly I'd only have to implement a class TIReaderPDF
and the difference to simple text reading is the part that extracts a
text stream or the text parts of the stream rejecting the pdf commands
(if they are in there, need to look at PowerPDF).
Marc
--
Marc Santhoff <M.Santhoff at web.de>
More information about the fpc-pascal
mailing list