Creating a cluster with HBase - Amazon EMR
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Creating a cluster with HBase

The procedures in this section cover the basics of launching a cluster using the Amazon Web Services Management Console and the Amazon CLI. For detailed information about how to plan, configure, and launch Amazon EMR clusters, see Plan and configure clusters in the Amazon EMR Management Guide.

Creating a cluster with HBase using the console

For quick steps to launch clusters with the console, see Getting started with Amazon EMR in the Amazon EMR Management Guide.

To launch a cluster with HBase installed using the console

  1. Open the Amazon EMR console at

  2. Choose Create cluster and Go to advanced options.

  3. For Software Configuration, choose an Amazon Release Version of 4.6.0 or later (we recommend the latest version). Choose HBase and other applications as desired.

  4. With Amazon EMR version 5.2.0 and later, under HBase Storage Settings, select HDFS or S3. For more information, see HBase on Amazon S3 (Amazon S3 storage mode).

  5. Select other options as necessary and then choose Create cluster.

Creating a cluster with HBase using the Amazon CLI

Use the following command to create a cluster with HBase installed:

aws emr create-cluster --name "Test cluster" --release-label emr-7.1.0 \ --applications Name=HBase --use-default-roles --ec2-attributes KeyName=myKey \ --instance-type m5.xlarge --instance-count 3

Linux line continuation characters (\) are included for readability. They can be removed or used in Linux commands. For Windows, remove them or replace with a caret (^).

If you use HBase on Amazon S3, specify the --configurations option with a reference to a JSON configuration object. The configuration object must contain an hbase-site classification that specifies the location in Amazon S3 where HBase data is stored using the hbase.rootdir property. It also must contain an hbase classification, which specifies s3 using the hbase.emr.storageMode property. The following example demonstrates a JSON snippet with these configuration settings.

[ { "Classification": "hbase-site", "Properties": { "hbase.rootdir": "s3://MyBucket/MyHBaseStore" } }, { "Classification": "hbase", "Properties": { "hbase.emr.storageMode": "s3" } } ]

For more information about HBase on Amazon S3, see HBase on Amazon S3 (Amazon S3 storage mode). For more information about classifications, see Configure applications.