Migrate data using CQLReplicator - Amazon Keyspaces (for Apache Cassandra)
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Migrate data using CQLReplicator

With CQLReplicator, you can read data from Apache Cassandra in near real time through intelligently scanning the Cassandra token ring using CQL queries. CQLReplicator doesn’t use Cassandra CDC and instead implements a caching strategy to reduce the performance penalties of full scans. To reduce the number of writes to the destination, CQLReplicator automatically removes duplicate replication events. With CQLReplicator, you can tune the replication of changes from the source database to the destination database, allowing for a near real time migration of data from Apache Cassandra to Amazon Keyspaces.

The following diagram shows the typical architecture of a CQLReplicator job using Amazon Glue.

  1. To allow access to Apache Cassandra running in a private VPC, configure an Amazon Glue connection with the connection type Network.

  2. To remove duplicates and enable key caching with the CQLReplicator job, configure Amazon Simple Storage Service (Amazon S3).

  3. The CQLReplicator job streams verified source database changes directly to Amazon Keyspaces.

Using CQLReplicator to migrate data from Apache Cassandra to Amazon Keyspaces.