Whats the best practice when using Amazon Redshift as a cache

https://dba.stackexchange.com/questions/80875

redshift

10-12-2020
|

Question

I have tested Amazon Redshift as a cache. So I save data by a key. But with the time the cache becomes slower and slower. Once I do a VACCUM on the database it get backs to the speed. But should i run VACCUM continuously in the database ?. Or is there a better way of using redshift tables for continuous reads, writes and deletes.

Solution

I would not recommend using redshift as a cache. Have you considered Redis?

Redshift saves all data to disk in 1MB blocks, in an order established by your sortkey, and distributed between nodes based on your distkey. When ever you create, update, delete you are appending data to the last blocks of the database. These blocks that hold all the new changes are not sorted until you vaccume the database.

Hence, when you use redshift as a cache you are very quickly fragmenting your data, making your queries slow.

OTHER TIPS

Certainly a BIG NO for using redshift as cache.

Unless, by cache you meant storing pre-computed aggregates.

Licensed under: CC-BY-SA with attribution

Not affiliated with dba.stackexchange