Question

I am uploading data from .csv files into Cassandra cluster deployed in Amazon EC2 using COPY command from EBS volume attached to it. I noticed that the Cassandra upload time increases very badly, with increase in size of the .csv file.

Is there a way in which I can tune the settings to increase the load rate for cassandra.. ?

Was it helpful?

Solution

Use a real bulk loader. COPY is not appropriate for millions of records.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top