Question

There appears to be a myriad of NoSQL databases available these days:

  • CouchDB
  • MongoDB
  • Cassandra
  • Hadoop

There's also a boundary between these tools and tools such as Redis that work as a memcached replacement.

Without hand waving and throwing too many buzz words - my question is the following:

How does one intelligently decide which tool here makes the most sense for their project? Are the projects similar enough to where the answer to this is subjective, eg: Ruby is better than Python or Python is better than Ruby? Or are we talking Apples and oranges here in that they each of them solve different problems?

What's the best way to educate myself on this new trend?

Was it helpful?

Solution

Perhaps one way to think of it is, programming has recently evolved from using one general-purpose language for everything to using the general-purpose language for most things, plus domain-specific languages for the more appropriate parts. For example, you might use Lua to script artificial intelligence of a character in a game.

NoSQL databases might be similar. SQL is the general purpose database with the longest and broadest adoption. While it could be shoehorned to serve many tasks, programmers are beginning to use NoSQL as a domain-specific database when it is more appropriate.

OTHER TIPS

I would argue, that the 4 major players you named do have quite different featuresets and try to solve different problems with different priority.

For instance, as far as i know Cassandra (and i assume Hadoop) central focus is on large scale installations.

MongoDb tries to be a better scaling alternative to classic SQL servers in providing comparably powerful query functions.

CouchDB's focus is comparably small scale (will not shard at all, "only" replicate), high durability and easy synchronization of data.

You might want to check out http://nosql-database.org/ for some more information.

I am facing pretty much the same problem as you, and i would say there is no real alternative to look at all solutions in detail.

Check out this site: http://cattell.net/datastores/ and in particular the PDF linked at the bottom (CACM Paper). The latter contains an excellent discussion of the relative merits of various data store solutions.

It's easy. NoSQL databases are ACID compliant databases minus some guarantees. So just decide which guarantees you can do without and find the database that fits. If you don't need durability for example, maybe redis is best. Or if you don't need multi-record transactions, then perhaps look into mongodb.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top