Document history for Amazon Glue Studio User Guide - Amazon Glue Studio
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Document history for Amazon Glue Studio User Guide

The following table describes the important changes in each revision of the Amazon Glue Studio User Guide. For notification about updates to this documentation, you can subscribe to an RSS feed.

ChangeDescriptionDate

Amazon Glue Studio now offers data quality

Amazon Glue Data Quality evaluates and monitors the quality of your data based on rules that you define. For more information, see Evaluating data quality with Amazon Glue Studio.

November 30, 2022

Amazon Glue Studio now offers custom visual transforms

Custom visual transforms let customers define, reuse, and share business-specific ETL logic among their teams. For more information, see Custom visual transforms .

November 28, 2022

Glue Studio is now available in China

Amazon Glue Studio is now available in the China Beijing and Ningxia regions.

October 11, 2021

Amazon Glue Studio offers notebook authoring, for interactive job editing

Notebooks help you to write and execute code, visualize the results, and share insights. Typically, data scientists use notebooks for experiments and data exploration tasks. For more information, see Using Notebooks.

October 1, 2021

Direct access to streaming sources now available

When adding data sources to your ETL job in the visual editor, you can supply information to access the data stream instead of having to use a Data Catalog database and table.

September 30, 2021

Custom connectors can now be used with data previews

When editing data source node using a custom connector, you can preview the dataset by choosing the Dat preview tab. For more information, see Custom Connectors .

September 24, 2021

Amazon Glue Studio supports Amazon Glue version 3.0

When creating jobs in Amazon Glue Studio, you can choose Glue 3.0 as the version for your job in the Job details tab. If you do not choose a version for your ETL job, Glue 2.0 is used by default.

August 18, 2021

Amazon GovCloud (US) Region

Amazon Glue Studio is now available in the Amazon GovCloud (US) Region

August 18, 2021

Python shell authoring available in Amazon Glue Studio

When creating a new job, you can now choose to create a Python shell job. For more information, see Start the job creation process and Editing Python shell jobs in Amazon Glue Studio.

August 13, 2021

Upload scripts to Amazon Glue Studio

In conjunction with the script editor feature, you can upload job scripts to Amazon Glue Studio. For more information, see Start the job creation process and Editing or uploading a job script.

June 14, 2021

View your job's dataset while creating and editing jobs

You can use the new Data preview tab for a node in your job diagram to see a sample of the data processed by that node. For more information, see Using data previews in the visual job editor.

June 7, 2021

Specify settings for your streaming ETL job in the visual job editor

You can configure additional connection settings for streaming data sources in the visual job editor to optimize your streaming ETL jobs. For more information, see Using a streaming data source.

June 4, 2021

Network connection support added

If you want to access a data source located in your VPC, you can specify a network connection for the job. For more information, see Modify the job properties.

May 24, 2021

Edit job scripts

You can now edit scripts in the job editor. For more information, see Editing a job script.

May 24, 2021

Delete jobs using the Amazon Glue Studio console

You can now delete jobs in Amazon Glue Studio. To learn how, see Delete jobs.

May 24, 2021

Read data from files in child folders in Amazon S3

You can specify a single folder in Amazon S3 as your data source and use the Recursive option to include all the child folders as part of the data source. For more information, see Using files in Amazon S3 for the data source.

April 30, 2021

Delete connectors and connections functionality added

You can now delete connectors and connections in Amazon Glue Studio. For more information, see Deleting connectors and connections.

April 30, 2021

Fill missing values transform added

You can use the FillMissingValues transform in Amazon Glue Studio to locate records in the dataset that have missing values and add a new field with an estimated value. For more information, see Editing the data transform node.

March 29, 2021

SQL transform available

You can use a SQL transform node to write your own transform in the form of a SQL query. For more information, see Using a SQL query to transform data.

March 23, 2021

JDBC source nodes now support job bookmark keys

Job bookmarks help Amazon Glue maintain state information and prevent the reprocessing of old data. For more information, see Authoring jobs with custom connectors.

March 15, 2021

Connectors can be used for data targets

Using a custom or Amazon Web Services Marketplace connector for your data target is now supported. For more information, see Authoring jobs with custom connectors.

March 15, 2021

A new toolbar is available for the visual job editor

A more streamlined and functional toolbar is available for the visual job editor of Amazon Glue Studio. This feature makes it easier to add nodes to your graph.

March 8, 2021

Read data from Amazon S3 without creating Data Catalog tables

Amazon Glue Studio now allows you to read data directly from Amazon S3 without first creating a table in the Amazon Glue Data Catalog. For more information, see Editing the data source node.

February 5, 2021

Amazon Glue Studio jobs can now update Data Catalog tables

Amazon Glue Studio now supports updating the Amazon Glue Data Catalog during job runs. This feature makes it easy to keep your tables up to date as your jobs write new data into Amazon S3. This makes the data immediately available for query from any analytics service that is compatible with the Amazon Glue Data Catalog. For more information, see Configuring data target nodes.

February 5, 2021

Job scheduling now available in Amazon Glue Studio

You can define a time-based schedule for your job runs in Amazon Glue Studio. You can use the console to create a basic schedule, or define a more complex schedule using the Unix-like cron syntax. For more information, see Schedule job runs.

December 21, 2020

Amazon Glue Custom Connectors released

Amazon Glue Custom Connectors allow you to discover and subscribe to connectors in Amazon Web Services Marketplace. We also released Amazon Glue Spark runtime interfaces to plug in connectors built for Apache Spark Datasource, Athena federated query, and JDBC APIs. For more information, see Using Connectors and connections with Amazon Glue Studio.

December 21, 2020

Support for running streaming ETL jobs in Amazon Glue version 2.0

Amazon Glue Studio now supports running streaming ETL jobs using Amazon Glue version 2.0. For more information, see Adding Streaming ETL Jobs in Amazon Glue in the Amazon Glue Developer Guide.

November 11, 2020

Availability of Amazon Glue Studio announced

Amazon Glue Studio provides a visual interface that simplifies the creation of jobs that prepare the data for analysis. The initial version of this guide was published on the same day Amazon Glue Studio launched.

September 23, 2020