Super Fast File Storage Engine

Question 1

After some research. I found that these data stores makes for the most part of use cases I have:

MVStore of H2 Database
MapDB
Oracle Bekeley DB Java Edition
And Kyoto Cabinet that seems to have inactive development

The interesting part is that all they mostly back the API of java collections (lists, sets, maps...)

EDIT: All these Proyects allows me to open a file as a data store of huge collections and I can reference them by name, and there can be many collections per file. Each of them are indexed. The idea is that these proyects are to be used as a foundation for real databases, you can view them as the data store engine of the database (be it SQL or NoSQL). Because these proyects are the foundation for proyects like mongodb, h2database and orientdb, then I am sure that if the simplistic datasotre approach fits my needs, it will also scale without any problems. Because my partition needs are very simplistic I can also share the load with other nodes.

Question 2

It sounds like you already have this running in postgres - can you post the schema you're using? It's certainly possible to do better than a well-tuned database in very specific scenarios, but usually turns out to be vastly more work than you imagine going in (especially if you're synchronizing writes).

Are you using CLUSTER with your index? What are the storage settings for the table?

And how large can the table get before your queries become too slow?

Question 3

Since you seem to be building an object store on top of PostgreSQL, why not use an object store instead?

I'd start with OpenStack Swift:

or, alternately, a distributed network file system, if that's closer to your needs. (ab)using PostgreSQL as a network file system isn't going to get you far if you care about performance. The only time I'd do that would be when I needed ACID semantics - such as atomic commits of some database changes along with a file they relate to.

You don't get atomic commit over multiple PostgreSQL instances (though you get close, with prepared tranactions) so I'm guessing that's not your use case. If it isn't, I suggest looking for the right too for the right job.