문제

We have hosted vertica on AWS and the total size of the db is 500GB. So what is the fastest method to migrate the data from vertica to redshift ?

one approach would be
1. export the vertica data to S3
2. load the data from S3 to redshift.

Is there any better approach ?

도움이 되었습니까?

해결책

There are three ways to get data into redshift:

  1. Load it from s3
  2. Load it from DynamoDB
  3. Do an insert query

Insert queries, even bulk ones, are bounded by the performance of the leader node of your redshift cluster. Options #1 and #2 are better since data insertion is done in parallel to/by the compute nodes of the cluster.

update: there is a 4th option now, load through ssh: http://docs.aws.amazon.com/redshift/latest/dg/loading-data-from-remote-hosts.html

라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top