Step 1: Create the source CSV file and a target table for the data upload using DSBulk
For this tutorial, we use a comma-separated values (CSV) file with the
name keyspaces_sample_table.csv
as the source file for the
data migration. The provided sample file contains a few rows of data for a table with the name book_awards
.
Create the source file. You can choose one of the following options:
-
Download the sample CSV file (
keyspaces_sample_table.csv
) contained in the following archive file samplemigration.zip. Unzip the archive and take note of the path tokeyspaces_sample_table.csv
. -
To populate a CSV file with your own data stored in an Apache Cassandra database, you can populate the source CSV file by using
dsbulk unload
as shown in the following example.dsbulk unload -k
mykeyspace
-tmytable
-f ./my_application.conf >keyspaces_sample_table.csv
Make sure the CSV file you create meets the following requirements:
-
The first row contains the column names.
-
The column names in the source CSV file match the column names in the target table.
-
The data is delimited with a comma.
-
All data values are valid Amazon Keyspaces data types. See Data types.
-
-
-
Create the target keyspace and table in Amazon Keyspaces.
Connect to Amazon Keyspaces using
cqlsh
, replacing the service endpoint, user name, and password in the following example with your own values.cqlsh
cassandra.us-east-2.amazonaws.com
9142 -u"111122223333"
-p"wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY"
--sslCreate a new keyspace with the name
catalog
as shown in the following example.CREATE KEYSPACE
catalog
WITH REPLICATION = {'class': 'SingleRegionStrategy'};After the new keyspace has a status of available, use the following code to create the target table
book_awards
. To learn more about asynchronous resource creation and how to check if a resource is available, see Check keyspace creation status in Amazon Keyspaces.CREATE TABLE
catalog.book_awards
( year int, award text, rank int, category text, book_title text, author text, publisher text, PRIMARY KEY ((year, award), category, rank) );
If Apache Cassandra is your original data source, a simple way to create the Amazon Keyspaces target table with matching headers is to generate the
CREATE TABLE
statement from the source table as shown in the following statement.cqlsh localhost 9042 -u "username" -p "password" --execute "DESCRIBE TABLE
mykeyspace.mytable
;"Then create the target table in Amazon Keyspaces with the column names and data types matching the description from the Cassandra source table.