how to parse a lot of PDFs
-
24-09-2019 - |
문제
I have a ton of PDFs I want to be able to parse sentence-by-sentence. Is there a tool for MySQL (or some other database system) for converting PDFs into mysql, and then reading out sentences one at a time? Is there some other tool to do this? I imagined loading all the pdfs into a DB and then reading would be the fastest way but I don't really know...
해결책
try pdftotext. Then insert it into DB.
제휴하지 않습니다 StackOverflow