Indexing pdf documents

문제

What the best way to index pdf documents? Should I index them by converting pdf documents to txt or there is a better way to index pdf files?

해결책

Assuming you're talking about solr: see the ExtractingRequestHandler.

라이센스 : CC-BY-SA ~와 함께 속성

제휴하지 않습니다 StackOverflow