Amazon Glue Spark and PySpark jobs
Amazon Glue support Spark and PySpark jobs. A Spark job is run in an Apache Spark environment managed by Amazon Glue. It processes data in batches. A streaming ETL job is similar to a Spark job, except that it performs ETL on data streams. It uses the Apache Spark Structured Streaming framework. Some Spark job features are not available to streaming ETL jobs.
The following sections provide information on Amazon Glue Spark and PySpark jobs.