1.

Solve : Rearching multiple PDF files?

Answer»

Hello,
I don't have/use Acrobat, but I have lots of pdf files that I have collected.
Many of them are books, History ETC, that I would like to search.
Rather than searching individualy, I'm wondering if there is a Search tool (shareware or something) that will search all of the files at once ?

I'm sure there is; so my questions are these;
1) what is a good one, (good = simple yet robust, easy to set up maintain....)
2) are there any that will do text and text as image, OCR
3) needs to work across many vintages of pdf files

I'm on Vista, and use Reader v10You might try this: http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611 I have not used it, so I can't COMMENT on its effectiveness. The article mentioned that it started with Reade 7, since I'm on 10 I assume it's bundled in there ??
Is there a way to confirm ?

My biggest issue is with the text as image and some sort of OCR based search....Any thing on the OCR part ?I guess there is no "hope" Quote from: cowboy1611 on October 28, 2011, 07:17:38 AM

I guess there is no "hope"
Right, maybe no hope. Here's something that came up from a Google search on ocr searchable pdf: http://www.scantopdf.eu/scantopdf-plugins-ocr.html Don't know whether it could be a viable solution or not. I think you'll have to dig into this deeper yourself. It not something I've ever delved into during my many years of computing and I suspect this is true for a good many other forum members, too. If you think about what it is that you are asking to HAPPEN you would conclude there is no App that can do it...1.Install Acrobat reader
2.use Windows Search, Google desktop, or another desktop search application
3.Profit!

'IFilters' are implementations of a well-defined interface that has been around since the advent of Windows Desktop Search, and they are used by most desktop search applications, including Google desktop. Basically, they allow the search tool to search the content of what would otherwise be a BINARY file, such as a word document, an Excel spreadsheet, or, in this CASE, a PDF document.

Note that in order for this to work, the PDF must actually be text; I have seen instances where a PDF is actually comprised of images that are embedded in a PDF, which sort of defeats the purpose, and won't work with IFilter's, either.

Adobe has packaged the IFilter implementation in Reader for as long as I can remember.


Discussion

No Comment Found