Question

I want to index a large number of pdf documents.

I have found a reference showing that it could be done using Apache Tika but unfortunately I cannot find any reference that describes I could configure Apache Tika in Solr 1.4.1.

Once configured I do have it configured, how can I send documents to Solr directly without using curl?

I am using solrnet for indexing.

Was it helpful?

OTHER TIPS

Support for ExtractingRequestHandler in SolrNet is not yet complete. You can either finish implementing it, or work around it and craft your own HttpWebRequests.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top