Question

I'm using Tesseract in one of my applications, and I'm having trouble obtaining perfect results. When I use the library on a perfect screenshot, it gets most of the text and that's fine. But when I use it on a photo for example, it has trouble working it out, and sometimes it won't even return anything.

I know you can train Tesseract, is there a good training tutorial or something for iPhone photo recognition? Or are there any modifications I should make to my source image before processing it?

Thanks!

Was it helpful?

Solution

Post a sample pic. Here are some things to look at:

How are you getting the image to 1-bit? Ultimately OCR is done in B/W images -- perhaps you can do a better job than the library.

What is the DPI of the image? Scans usually are around 200-300 DPI, but photos can be 72 or 96. OCR engines look at DPI to get a sense of the size of the letters. You might need to change the DPI and resample (resize) the image so that the DPI matches expectations. OCR engines are expecting scanned documents, not photographs.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top