Using a hybrid migration solution: Apache Cassandra to Amazon Keyspaces - Amazon Keyspaces (for Apache Cassandra)
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Using a hybrid migration solution: Apache Cassandra to Amazon Keyspaces

The following migration solution can be considered a hybrid between online and offline migration. With this hybrid approach, data is written to the destination database in near real time without providing read after write consistency. This means that newly written data won’t be immediately available and delays are to be expected. If you need read after write consistency, see Online migration to Amazon Keyspaces: strategies and best practices.

For a near real time migration from Apache Cassandra to Amazon Keyspaces, you can choose between two available methods.

  • CQLReplicator – CQLReplicator is an open source utility available on Github that helps you to migrate data from Apache Cassandra to Amazon Keyspaces in near real time. To determine the writes and updates to propagate to the destination database, CQLReplicator scans the Apache Cassandra token range and uses an Amazon Glue job to remove duplicate events and apply writes and updates directly to Amazon Keyspaces.

  • Change data capture (CDC) – Apache Cassandra offers a built-in CDC feature that allows capturing changes by copying the commit log to a separate CDC directory. You can then use these logs to replicate data changes to other systems such as Amazon Keyspaces, making CDC an effective option for data migration scenarios.

If you don't need read after write consistency, you can use either the CQLReplicator or a CDC pipeline to migrate data from Apache Cassandra to Amazon Keyspaces based on your preferences and familiarity with the tools and Amazon Web Services used in each solution. Using these methods to migrate data in near real time can be considered a hybrid approach to migration that offers an alternative to online migration. You still need to implement the other key components of online migration, for example historical data copy and the application migration strategies discussed in the online migration topic.

The following sections go over the hybrid migration options in more detail.