[fpc-pascal] PDF indexing

Michael Van Canneyt michael at freepascal.org
Tue Jun 23 09:10:21 CEST 2015



On Tue, 23 Jun 2015, Marc Santhoff wrote:

> On So, 2015-06-21 at 00:33 +0200, Michael Van Canneyt wrote:
>>
>> On Sat, 20 Jun 2015, Marc Santhoff wrote:
>>
>>> Hi,
>>>
>>> does fpc (or lazarus) have a helper class for indexing the content of
>>> PDF files?
>>
>> check packages/fpindexer
>>
>> I have used it to create full text searches on a database.
>> You should be able to adapt the base code to create an index of a PDF.
>
> That looks pretty intresting. And it has some docs, wow.
>
> If I understand correctly I'd only have to implement a class TIReaderPDF
> and the difference to simple text reading is the part that extracts a
> text stream or the text parts of the stream rejecting the pdf commands
> (if they are in there, need to look at PowerPDF).

Yes, that would be correct.

Michael.



More information about the fpc-pascal mailing list