What's the best tool to use to convert scanned PDFs so they can be indexed in SharePoint?

https://sharepoint.stackexchange.com/questions/831

search
pdf
ifilter
2007
crawling

16-10-2019
|

Question

I have a large amount of PDFs and I need to be able to search them in MOSS 2007. I am aware of the iFilter which is required, but it will not index scanned PDFs. What is the fastest way to handle this problem?

Solution

Image PDFs do behave differently than those created from word docs. You can probably get some OCR tools to help. Its been awhile, but I think KnowledgeLake had the ability to OCR and auto-index some of the scanned documents.

OTHER TIPS

Well, I heard that ABBYY FineReader 10 is a very good software to do this (I never tried it). It will probably include some DEV work on your side to automate this.

You can use Acrobat Pro. It contains an OCR Text Recognition batch command that will convert entire folders.

Licensed under: CC-BY-SA with attribution

Not affiliated with sharepoint.stackexchange