Pregunta

I'd like to use Solr as the client-side search engine for published Tridion content. It'll probably be done as a RESTful service that is disconnected from the main application.

As we'll almost certainly be using boilerplate DD4T where everything is published to the Broker, I have some concerns particularly when indexing binaries such as PDF or Word files - sounds like there could be an awful lot of strain on the DB?

What strategy is recommended for retrieving binaries and indexing them in this way? It sounds like it's going to be more difficult than if we had binary items external to the DB?

¿Fue útil?

Solución

We have made the decision to publish binaries to the filesystem - you just configure this in cd_storage_conf.xml with something like this:

<Publication Id="9" defaultStorageId="defaultdb" cached="true">
    <Item typeMapping="Binary" storageId="defaultFile" cached="true"/>
</Publication>

However, even if you do choose to publish binaries to the database, this should not impact your Solr index which will be completely seperate to the broker database. You will need to write something custom (Custom Deployer?) that pushes your data into your Solr index, and you can choose to ignore binaries for this.

Licenciado bajo: CC-BY-SA con atribución
No afiliado a StackOverflow
scroll top