Setting up to use Python with Amazon Glue - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Setting up to use Python with Amazon Glue

Use Python to develop your ETL scripts for Spark jobs. The supported Python versions for ETL jobs depend on the Amazon Glue version of the job. For more information on Amazon Glue versions, see the Glue version job property.

To set up your system for using Python with Amazon Glue

Follow these steps to install Python and to be able to invoke the Amazon Glue APIs.

  1. If you don't already have Python installed, download and install it from the Python.org download page.

  2. Install the Amazon Command Line Interface (Amazon CLI) as documented in the Amazon CLI documentation.

    The Amazon CLI is not directly necessary for using Python. However, installing and configuring it is a convenient way to set up Amazon with your account credentials and verify that they work.

  3. Install the Amazon SDK for Python (Boto 3), as documented in the Boto3 Quickstart .

    Boto 3 resource APIs are not yet available for Amazon Glue. Currently, only the Boto 3 client APIs can be used.

    For more information about Boto 3, see Amazon SDK for Python (Boto3) Getting Started.

You can find Python code examples and utilities for Amazon Glue in the Amazon Glue samples repository on the GitHub website.