Indexing documents with websolr

https://stackoverflow.com/questions/15051264

lucene
ruby-on-rails
solr
heroku
websolr

11-03-2022
|

문제

We're looking at using the Websolr add-on for searching Resources within our Rails app.

The app contains many Resource models. Most of the resource models are self-contained, with a series of attributes: author, title, a set of tags etc, however some of the Resource models have a pdf attached. We need to index the content of this pdf so that it is searchable as a part of the Resource.

How should I approach this?

해결책

This should get you started: ExtractingRequestHandler, which integrates Tika into Solr.

라이센스 : CC-BY-SA ~와 함께 속성

제휴하지 않습니다 StackOverflow