<html>
<head>
<meta content="text/html; charset=windows-1252"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 24/06/2015 12:00,
<a class="moz-txt-link-abbreviated" href="mailto:fpc-pascal-request@lists.freepascal.org">fpc-pascal-request@lists.freepascal.org</a> wrote:<br>
</div>
<blockquote
cite="mid:mailman.1.1435140002.2644.fpc-pascal@lists.freepascal.org"
type="cite">
<pre wrap="">Date: Wed, 24 Jun 2015 08:47:20 +0200
From: Marc Santhoff <a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="mailto:M.Santhoff@web.de"><M.Santhoff@web.de></a>
To: <a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:fpc-pascal@lists.freepascal.org">fpc-pascal@lists.freepascal.org</a>
Subject: Re: [fpc-pascal] PDF indexing
Message-ID: <a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="mailto:1435128440.11596.3.camel@puma.das.netz"><1435128440.11596.3.camel@puma.das.netz></a>
Content-Type: text/plain; charset="ISO-8859-15"
<span class="moz-txt-citetags">> > </span>Currently I'm searching a pdf access library that could help doing so.
<span class="moz-txt-citetags">> > </span>The only one halfway fitting up to now is this one:
<span class="moz-txt-citetags">> ></span>
<span class="moz-txt-citetags">> > </span><a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://itextpdf.com/functionality">http://itextpdf.com/functionality</a>
<span class="moz-txt-citetags">> ></span>
<span class="moz-txt-citetags">> > </span>Open Source but a license similar to LGPL without exception. Still
<span class="moz-txt-citetags">> > </span>searching ...
</pre>
<blockquote type="cite" style="color: #000000;">
<pre wrap=""><span class="moz-txt-citetags">> </span>
<span class="moz-txt-citetags">> </span>But Java or .Net.
</pre>
</blockquote>
<pre wrap="">Really? Oops, oversight. I thought there would be a C version.
</pre>
<blockquote type="cite" style="color: #000000;">
<pre wrap=""><span class="moz-txt-citetags">> </span>Depending on your platform you may attempt gnostice products. It's Delphi code,
<span class="moz-txt-citetags">> </span>but they are quite open and I was told the upcoming rework of their products
<span class="moz-txt-citetags">> </span>will make support for Lazarus possible.
</pre>
</blockquote>
<pre wrap="">I'll have a look at that one. Pure Object Pascal would be very nice.
And I found poppler, which is a fork of xpdf compiling into a library.
Part of Gnome, so LGPL.
Another solution would be to use TProcess and an external tool, there
are some "pdf2txt".variants out there.
Thanks again,
Marc
<div class="moz-txt-sig">--
Marc Santhoff <a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="mailto:M.Santhoff@web.de"><M.Santhoff@web.de></a>
</div></pre>
</blockquote>
Marc,<br>
<br>
Take a look on Xpdf project<br>
"<font face="sans-serif">The Xpdf project also includes a
PDF text extractor</font>"<br>
<a class="moz-txt-link-freetext" href="http://www.foolabs.com/xpdf/about.html">http://www.foolabs.com/xpdf/about.html</a><br>
<br>
<br>
A+<br>
Serguei<br>
</body>
</html>