[fpc-pascal] PDF indexing

Marc Santhoff M.Santhoff at web.de
Tue Jun 23 05:07:17 CEST 2015


On So, 2015-06-21 at 00:33 +0200, Michael Van Canneyt wrote:
> 
> On Sat, 20 Jun 2015, Marc Santhoff wrote:
> 
> > Hi,
> >
> > does fpc (or lazarus) have a helper class for indexing the content of
> > PDF files?
> 
> check packages/fpindexer
> 
> I have used it to create full text searches on a database. 
> You should be able to adapt the base code to create an index of a PDF.

That looks pretty intresting. And it has some docs, wow.

If I understand correctly I'd only have to implement a class TIReaderPDF
and the difference to simple text reading is the part that extracts a
text stream or the text parts of the stream rejecting the pdf commands
(if they are in there, need to look at PowerPDF).

Marc

-- 
Marc Santhoff <M.Santhoff at web.de>




More information about the fpc-pascal mailing list