Step 1: Create a sample Amazon Redshift cluster - Amazon Redshift
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Step 1: Create a sample Amazon Redshift cluster

In this tutorial, you walk through the process to create an Amazon Redshift cluster with a database. Then you load a dataset from Amazon S3 to tables your database. You can use this sample cluster to evaluate the Amazon Redshift service.

Before you begin setting up an Amazon Redshift cluster, make sure that you complete any necessary prerequisites such as Signing up for Amazon and Determine firewall rules.

For any operation that accesses data from another Amazon resource, your cluster needs permission to access the resource and the data on the resource on your behalf. An example is using a SQL COPY command to load data from Amazon Simple Storage Service (Amazon S3). You provide those permissions by using Amazon Identity and Access Management (IAM). You can do this through an IAM role that you create and attached to your cluster. For more information about credentials and access permissions, see Credentials and access permissions in the Amazon Redshift Database Developer Guide.

To create an Amazon Redshift cluster
  1. Sign in to the Amazon Web Services Management Console and open the Amazon Redshift console at https://console.amazonaws.cn/redshiftv2/.

    Important

    If you use IAM user credentials, ensure that you have the necessary permissions to perform the cluster operations. For more information, see Security in Amazon Redshift in the Amazon Redshift Management Guide.

  2. On the Amazon console, choose the Amazon Web Services Region where you want to create the cluster.

  3. On the navigation menu, choose Clusters, then choose Create cluster. The Create cluster page appears.

  4. In the Cluster configuration section, specify values for Cluster identifier, Node type, and Nodes:

    • Cluster identifier: Enter examplecluster for this tutorial. This identifier must be unique. The identifier must be from 1–63 characters using as valid characters a–z (lowercase only) and - (hyphen).

    • Choose one of the following methods to size your cluster:

      Note

      The following step assumes an Amazon Web Services Region that supports RA3 node types. For a list of Amazon Web Services Regions that support RA3 node types, see Overview of RA3 node types in the Amazon Redshift Management Guide. To learn more about the node specifications for each node type and size, see Node type details.

      • If you don't know how large to size your cluster, choose Help me choose. Doing so opens a sizing calculator that asks you questions about the size and query characteristics of the data that you plan to store in your data warehouse.

        If you know the required size of your cluster (that is, the node type and number of nodes), choose I'll choose. Then choose the Node type and number of Nodes to size your cluster.

        For this tutorial, choose ra3.4xlarge for Node type and 2 for Number of nodes.

        If a choice for AZ configuration is available, choose Single-AZ.

      • To use the sample dataset Amazon Redshift provides, in Sample data, choose Load sample data. Amazon Redshift loads the sample dataset Tickit to the default dev database and public schema.

  5. In the Database configuration section, specify a value for Admin user name. For Admin password, choose from the following options:

    • Generate a password – Use a password generated by Amazon Redshift.

    • Manually add an admin password – Use your own password.

    • Manage admin credentials in Amazon Secrets Manager – Amazon Redshift uses Amazon Secrets Manager to generate and manage your admin password. Using Amazon Secrets Manager to generate and manage your password's secret incurs a fee. For information on Amazon Secrets Manager pricing, see Amazon Secrets Manager Pricing.

    For this tutorial, use these values:

    • Admin user name: Enter awsuser.

    • Admin user password: Enter a value for the password.

  6. For this tutorial, create an IAM role and set it as the default for your cluster, as described following. There can only be one default IAM role set for a cluster.

    1. Under Cluster permissions, for Manage IAM roles, choose Create IAM role.

    2. Specify an Amazon S3 bucket for the IAM role to access by one of the following methods:

      • Choose No additional Amazon S3 bucket to allow the created IAM role to access only the Amazon S3 buckets that are named as redshift.

      • Choose Any Amazon S3 bucket to allow the created IAM role to access all Amazon S3 buckets.

      • Choose Specific Amazon S3 buckets to specify one or more Amazon S3 buckets for the created IAM role to access. Then choose one or more Amazon S3 buckets from the table.

    3. Choose Create IAM role as default. Amazon Redshift automatically creates and sets the IAM role as the default for your cluster.

      Because you created your IAM role from the console, it has the AmazonRedshiftAllCommandsFullAccess policy attached. This allows Amazon Redshift to copy, load, query, and analyze data from Amazon resources in your IAM account.

    For information on how to manage the default IAM role for a cluster, see Creating an IAM role as default for Amazon Redshift in the Amazon Redshift Management Guide.

  7. (Optional) In the Additional configurations section, turn off Use defaults to modify Network and security, Database configuration, Maintenance, Monitoring, and Backup settings.

    In some cases, you might create your cluster with the Load sample data option and want to turn on enhanced Amazon VPC routing. If so, the cluster in your virtual private cloud (VPC) requires access to the Amazon S3 endpoint for data to be loaded.

    To make the cluster publicly accessible, you can do one of two things. You can configure a network address translation (NAT) address in your VPC for the cluster to access the internet. Or you can configure an Amazon S3 VPC endpoint in your VPC. For more information about enhanced Amazon VPC routing, see Enhanced Amazon VPC routing in the Amazon Redshift Management Guide.

  8. Choose Create cluster. Wait for your cluster to be created with Available status on the Clusters page.