Question

I have a large amount of PDFs and I need to be able to search them in MOSS 2007. I am aware of the iFilter which is required, but it will not index scanned PDFs. What is the fastest way to handle this problem?

Was it helpful?

Solution

Image PDFs do behave differently than those created from word docs. You can probably get some OCR tools to help. Its been awhile, but I think KnowledgeLake had the ability to OCR and auto-index some of the scanned documents.

OTHER TIPS

Well, I heard that ABBYY FineReader 10 is a very good software to do this (I never tried it). It will probably include some DEV work on your side to automate this.

You can use Acrobat Pro. It contains an OCR Text Recognition batch command that will convert entire folders.

Licensed under: CC-BY-SA with attribution
Not affiliated with sharepoint.stackexchange
scroll top