Suggestions about how to mine information on the internet and extract text out of an image [closed]

StackOverflow https://stackoverflow.com/questions/19028252

  •  29-06-2022
  •  | 
  •  

문제

I'm interested in how I can mine information on the internet and how to extract text out of an image.

So I'm searching for information on how to do this, I would like to program this on my own. Are there any papers that gives me a good explanation about mining and extracting?

Can someone help me on the way?

Kind regards,

도움이 되었습니까?

해결책

You can take a look at Tess4J which is a java wrapper for Tesseract. That being said, image processing (text extraction) usually requires some pre-processing first, removing colours and sections which you know contain no text being some of the most common.

라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top