How to configure Apache Tika with apache Solr 1.4.1

https://stackoverflow.com/questions/3863906

solr
solrnet
solr-cell
apache-tika

27-09-2019
|

Question

I want to index a large number of pdf documents.

I have found a reference showing that it could be done using Apache Tika but unfortunately I cannot find any reference that describes I could configure Apache Tika in Solr 1.4.1.

Once configured I do have it configured, how can I send documents to Solr directly without using curl?

I am using solrnet for indexing.

Solution

See ExtractingRequestHandler

OTHER TIPS

Support for ExtractingRequestHandler in SolrNet is not yet complete. You can either finish implementing it, or work around it and craft your own HttpWebRequests.

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow