Data modeling best practices: recommendations for designing data models - Amazon Keyspaces (for Apache Cassandra)
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Data modeling best practices: recommendations for designing data models

Effective data modeling is crucial for optimizing performance and minimizing costs when working with Amazon Keyspaces (for Apache Cassandra). This topic covers key considerations and recommendations for designing data models that suit your application's data access patterns.

  • Partition Key Design – The partition key plays a critical role in determining how data is distributed across partitions in Amazon Keyspaces. Choosing an appropriate partition key can significantly impact query performance and throughput costs. This section discusses strategies for designing partition keys that promote even distribution of read and write activity across partitions.

  • Key Considerations:

    • Uniform activity distribution – Aim for uniform read and write activity across all partitions to minimize throughput costs and leverage burst capacity effectively.

    • Access patterns – Align your partition key design with your application's primary data access patterns.

    • Partition size – Avoid creating partitions that grow too large, as this can impact performance and increase costs.