Question

I currently have a number of databases in a hadoop cluster and wish to index some tables from these databases into a Solr index for searching. Is there a way this can be done? Or is there some mechanism to perform this kind of search in hadoop itself?

Was it helpful?

Solution

Check out : http://katta.sourceforge.net/

This is integration of Hadoop / Lucene for distributed index and shards.

OTHER TIPS

You can use hadoop itself. However if you are performing various regular expression search, then solr is a very good option. Are you use hive or hbase in hadoop to store your database, or are you storing in flat file?

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top