Question

I'm looking for the database (python api) for tasks described below.

There is an increasing amount of data. In the initial period of accumulation of the data will be simple analysis that can be done with simple sql queries. But in the future it is planned to extract data from more complex queries, finding complex relationships. Requires initially choose a data storage system, which in future will allow to analyze this database different, things get complicated ( as you explore topics and skills development ) tools.

Example:

First, there is only data on buckwheat and rice. Required to compare sales growth during the month. No problem - two sql query by product name, limited sampling time (month). Drawn graphs, clearly see what is what. There are more kinds of goods. Now it took to learn how to depend on sales growth Soya sauce from rice sales growth. This is somehow possible from using sql query. And now we have 5000 names in the database and requires using some algorithms (eg neural networks) seek any dependencies in the database automatically. That is, start with simple, needs grow, become more complicated tools. What db suited to increasing requirements, being a simple enough for use in the beginning? Is Redis, for example?

I would have been very useful to know that in my question incorrectly. I am totally new to this subject. it tells me what to look

Was it helpful?

Solution

I agree, MongoDB is suited for that. If you had millions of entries with multiple relations, SQL would be ahead. But for some thousands of entries a document-based DB does the job. As a benefit you don't have to care about the structure of your DB before you create it and are able to easily change it later. Take a look at the PyMongo Tutorial.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top