Programming Spark scripts - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Programming Spark scripts

Amazon Glue makes it easy to write or autogenerate extract, transform, and load (ETL) scripts, in addition to testing and running them. This section describes the extensions to Apache Spark that Amazon Glue has introduced, and provides examples of how to code and run ETL scripts in Python and Scala.


Different versions of Amazon Glue support different versions of Apache Spark. Your custom script must be compatible with the supported Apache Spark version. For information about Amazon Glue versions, see the Glue version job property.