Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions,
see Getting Started with Amazon Web Services in China
(PDF).
Migrate data using CQLReplicator
With CQLReplicator, you can read data from Apache Cassandra in near real time through intelligently
scanning the Cassandra token ring using CQL queries. CQLReplicator doesn’t use Cassandra CDC and instead implements
a caching strategy to reduce the performance penalties of full scans. To reduce the number of writes to the destination,
CQLReplicator automatically removes duplicate replication events. With CQLReplicator, you can tune the replication of
changes from the source database to the destination database, allowing for a near real time migration of data from
Apache Cassandra to Amazon Keyspaces.
The following diagram shows the typical architecture of a CQLReplicator job using Amazon Glue.
To allow access
to Apache Cassandra running in a private VPC, configure an Amazon Glue connection with the connection type
Network.
To remove duplicates and enable key caching with the CQLReplicator job, configure
Amazon Simple Storage Service (Amazon S3).
The CQLReplicator job streams verified source database changes directly to Amazon Keyspaces.