This is a situation where data.table
might be a very good option. data.table
has consistently shown to be blisteringly fast, much more so that plyr
. There are many examples here on SO, see e.g.:
- how to operate with a subset of an R dataframe in long format?
- Using plyr, doMC, and summarise() with very big dataset?
- or a blogpost of mine.
This is just a very small portion of the information available, you can check out the documentation of data.table
, or look at the [r][data.table]
tags on SO.