<?xml version="1.0"?><!-- generator="bbPress" -->

<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
>

<channel>
<title>Forum Topic: xpdf integration</title>
<link>http://www.kbpublisher.com/forums/</link>
<description>Forum Topic: xpdf integration</description>
<language>en</language>
<pubDate>Fri, 10 Sep 2010 14:29:30 +0000</pubDate>

<item>
<title>rocket2009 on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-338</link>
<pubDate>Fri, 01 May 2009 05:34:45 +0000</pubDate>
<dc:creator>rocket2009</dc:creator>
<guid isPermaLink="false">338@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;I had your team install my system and I was the test of xpdf.  I have uploaded several pdf files and they were converted to text by xpdf as I can see the yes and click on it.&#60;/p&#62;
&#60;p&#62;My problem is I can't seem to create a search that finds any of the articles.  I try small words, long words, multiple words, I go into advanced search and select attachments and inline files, I select all categories by the all button and by highlighting all.&#60;/p&#62;
&#60;p&#62;Search works on articles, but I can't seem to get it work reliably on attachments.  Any hints?
&#60;/p&#62;</description>
</item>
<item>
<title>onesign on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-303</link>
<pubDate>Tue, 02 Dec 2008 16:22:45 +0000</pubDate>
<dc:creator>onesign</dc:creator>
<guid isPermaLink="false">303@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;It runs when you add/update file. No need to wait. Files added before enabling xpdf have never been indexed. You have to update it.
&#60;/p&#62;</description>
</item>
<item>
<title>cnielsen on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-302</link>
<pubDate>Fri, 28 Nov 2008 21:15:57 +0000</pubDate>
<dc:creator>cnielsen</dc:creator>
<guid isPermaLink="false">302@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;ahh now i see how it work's :-) sry it's nothing of my daily business...&#60;br /&#62;
and how long do i have to wait till the mysql fulltext index run's? every hour or when?
&#60;/p&#62;</description>
</item>
<item>
<title>onesign on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-301</link>
<pubDate>Fri, 28 Nov 2008 20:17:08 +0000</pubDate>
<dc:creator>onesign</dc:creator>
<guid isPermaLink="false">301@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;Text from pdf fle will be extracted to the database, it wil be indexed by MySQL fulltext index.&#60;br /&#62;
Text extracted when you add file.&#60;br /&#62;
There is a &#34;Text&#34; field in files listing if extraction successful then you can able to see extracted text.
&#60;/p&#62;</description>
</item>
<item>
<title>cnielsen on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-300</link>
<pubDate>Fri, 28 Nov 2008 15:07:05 +0000</pubDate>
<dc:creator>cnielsen</dc:creator>
<guid isPermaLink="false">300@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;so when i upload a new pdf file, kbp will extract the pdf to txt and then i'll have two files in my kb_file folder, one called test.pdf and one called test.txt, is this correct? and how long do i have to wait for indexing the new files?
&#60;/p&#62;</description>
</item>
<item>
<title>onesign on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-299</link>
<pubDate>Thu, 27 Nov 2008 18:50:02 +0000</pubDate>
<dc:creator>onesign</dc:creator>
<guid isPermaLink="false">299@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;This tool does not search in files, it extract index raw text from pdf files and KBP index such files.&#60;br /&#62;
So search  will be possible for new uploaded files (after xpdx installation).&#60;br /&#62;
In next KBPublisher release it will be possible to reindex existing files.&#60;/p&#62;
&#60;p&#62;Make tests with php and real path.&#60;br /&#62;
php system('/usr/path_to_xpdf/pdftotext -raw file_read.pdf file_write.txt', $return);&#60;/p&#62;
&#60;p&#62;try to set with $file_conf['extract_tool']['pdf'] = '';
&#60;/p&#62;</description>
</item>
<item>
<title>cnielsen on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-298</link>
<pubDate>Thu, 27 Nov 2008 15:26:48 +0000</pubDate>
<dc:creator>cnielsen</dc:creator>
<guid isPermaLink="false">298@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;sry for the misunderstanding - no it doesnt work for me in my kbpublisher-installation. i can convert pdf's to txt's on the command prompt, but i can not search in pdf documents attached to kb-articles.
&#60;/p&#62;</description>
</item>
<item>
<title>onesign on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-297</link>
<pubDate>Wed, 26 Nov 2008 15:44:20 +0000</pubDate>
<dc:creator>onesign</dc:creator>
<guid isPermaLink="false">297@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;Sorry, what is the question? Does it work for you?
&#60;/p&#62;</description>
</item>
<item>
<title>cnielsen on "xpdf integration"</title>
<link>http://www.kbpublisher.com/forums/topic/xpdf-integration#post-296</link>
<pubDate>Tue, 25 Nov 2008 12:04:30 +0000</pubDate>
<dc:creator>cnielsen</dc:creator>
<guid isPermaLink="false">296@http://www.kbpublisher.com/forums/</guid>
<description>&#60;p&#62;I've integrated xpdf for searching in pdf files. The command &#34;pdftotext -raw example.pdf example.txt&#34; works fine. Here's my config.inc.php:&#60;/p&#62;
&#60;p&#62;&#38;lt;?php&#60;br /&#62;
$win = (substr(PHP_OS, 0, 3) == &#34;WIN&#34;);&#60;/p&#62;
&#60;p&#62;// change this if you install xpdf to other directory&#60;br /&#62;
$file_conf['extract_tool']['pdf'] = ($win) ? APP_EXTRA_MODULE_DIR . 'file_extractors/xpdf/win/'&#60;br /&#62;
                                           : '/usr/local/groundwork/apache2/htdocs/kb/admin/extra/file_extractors/xpdf/win/';&#60;br /&#62;
?&#38;gt;&#60;/p&#62;
&#60;p&#62;Have someone made this running in his environment?&#60;br /&#62;
Thank's!
&#60;/p&#62;</description>
</item>

</channel>
</rss>
