문제

What the best way to index pdf documents? Should I index them by converting pdf documents to txt or there is a better way to index pdf files?

도움이 되었습니까?

해결책

Assuming you're talking about solr: see the ExtractingRequestHandler.

라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top