How expensive is a "nodetool repair" operation?

Question

Some uses cases are more dependent upon regular repairs than others. If you perform deletes at less than ConsistencyLevel.ALL then you should run repair to ensure deleted columns don't come back to life. If you don't do deletes, you can rely on hinted handoff and read repair to maintain consistency in many cases. If you read and write at low consistency levels, or regularly have server downtime or overloading, you will probably want to run repair.

What repair does is read through all the data on the node you run it on (optionally, with the -pr (primary range) option, only the ranges for which the node owns the primary range) and build up a Merkle tree. It also sends a message to all nodes that store replicas of any of these ranges to do the same - they will only read through the data that is replicated on the initial repair node.

To build a Merkle tree on a node with 500 GB will read through the full 500 GB (when using -pr, it will be roughly a factor of replication factor lower). However, the Merkle trees are constant size (a few MB) so very little is sent over the network if the nodes are in sync.

The best way to run scheduled repairs is to run with -pr on each node in turn. This avoids repairing the same data multiple times. Also, only run on one node at once to avoid placing extra load on your cluster.