What architecture would you use to store 10 billion MIME messages and make it deletable and full text searchable incl. attachments

StackOverflow https://stackoverflow.com/questions/8078923

Question

I would like to use components that are free for commercial use.

I looked at a Lucene and MongoDB combo but wonder if there are better approaches, ideally a single system.

Was it helpful?

Solution

Sphinx can also handle billions of documents http://sphinxsearch.com/info/powered/

(although I also use Lucene and cannot tell whether Sphinx is better)

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top