Document history for Amazon Glue Studio User Guide
The following table describes the important changes in each revision of the Amazon Glue Studio User Guide. For notification about updates to this documentation, you can subscribe to an RSS feed.
Change | Description | Date |
---|---|---|
Amazon Glue Studio now offers data quality | Amazon Glue Data Quality evaluates and monitors the quality of your data based on rules that you define. For more information, see Evaluating data quality with Amazon Glue Studio. | November 30, 2022 |
Amazon Glue Studio now offers custom visual transforms | Custom visual transforms let customers define, reuse, and share business-specific ETL logic among their teams. For more information, see Custom visual transforms . | November 28, 2022 |
Glue Studio is now available in China | Amazon Glue Studio is now available in the China Beijing and Ningxia regions. | October 11, 2021 |
Amazon Glue Studio offers notebook authoring, for interactive job editing | Notebooks help you to write and execute code, visualize the results, and share insights. Typically, data scientists use notebooks for experiments and data exploration tasks. For more information, see Using Notebooks. | October 1, 2021 |
Direct access to streaming sources now available | When adding data sources to your ETL job in the visual editor, you can supply information to access the data stream instead of having to use a Data Catalog database and table. | September 30, 2021 |
Custom connectors can now be used with data previews | When editing data source node using a custom connector, you can preview the dataset by choosing the Dat preview tab. For more information, see Custom Connectors . | September 24, 2021 |
Amazon Glue Studio supports Amazon Glue version 3.0 | When creating jobs in Amazon Glue Studio, you can choose Glue 3.0 as the version for your job in the Job details tab. If you do not choose a version for your ETL job, Glue 2.0 is used by default. | August 18, 2021 |
Amazon GovCloud (US) Region | Amazon Glue Studio is now available in the Amazon GovCloud (US) Region | August 18, 2021 |
Python shell authoring available in Amazon Glue Studio | When creating a new job, you can now choose to create a Python shell job. For more information, see Start the job creation process and Editing Python shell jobs in Amazon Glue Studio. | August 13, 2021 |
Upload scripts to Amazon Glue Studio | In conjunction with the script editor feature, you can upload job scripts to Amazon Glue Studio. For more information, see Start the job creation process and Editing or uploading a job script. | June 14, 2021 |
View your job's dataset while creating and editing jobs | You can use the new Data preview tab for a node in your job diagram to see a sample of the data processed by that node. For more information, see Using data previews in the visual job editor. | June 7, 2021 |
Specify settings for your streaming ETL job in the visual job editor | You can configure additional connection settings for streaming data sources in the visual job editor to optimize your streaming ETL jobs. For more information, see Using a streaming data source. | June 4, 2021 |
Network connection support added | If you want to access a data source located in your VPC, you can specify a network connection for the job. For more information, see Modify the job properties. | May 24, 2021 |
Edit job scripts | You can now edit scripts in the job editor. For more information, see Editing a job script. | May 24, 2021 |
Delete jobs using the Amazon Glue Studio console | You can now delete jobs in Amazon Glue Studio. To learn how, see Delete jobs. | May 24, 2021 |
Read data from files in child folders in Amazon S3 | You can specify a single folder in Amazon S3 as your data source and use the Recursive option to include all the child folders as part of the data source. For more information, see Using files in Amazon S3 for the data source. | April 30, 2021 |
Delete connectors and connections functionality added | You can now delete connectors and connections in Amazon Glue Studio. For more information, see Deleting connectors and connections. | April 30, 2021 |
Fill missing values transform added | You can use the FillMissingValues transform in Amazon Glue Studio to locate records in the dataset that have missing values and add a new field with an estimated value. For more information, see Editing the data transform node. | March 29, 2021 |
SQL transform available | You can use a SQL transform node to write your own transform in the form of a SQL query. For more information, see Using a SQL query to transform data. | March 23, 2021 |
JDBC source nodes now support job bookmark keys | Job bookmarks help Amazon Glue maintain state information and prevent the reprocessing of old data. For more information, see Authoring jobs with custom connectors. | March 15, 2021 |
Connectors can be used for data targets | Using a custom or Amazon Web Services Marketplace connector for your data target is now supported. For more information, see Authoring jobs with custom connectors. | March 15, 2021 |
A new toolbar is available for the visual job editor | A more streamlined and functional toolbar is available for the visual job editor of Amazon Glue Studio. This feature makes it easier to add nodes to your graph. | March 8, 2021 |
Read data from Amazon S3 without creating Data Catalog tables | Amazon Glue Studio now allows you to read data directly from Amazon S3 without first creating a table in the Amazon Glue Data Catalog. For more information, see Editing the data source node. | February 5, 2021 |
Amazon Glue Studio jobs can now update Data Catalog tables | Amazon Glue Studio now supports updating the Amazon Glue Data Catalog during job runs. This feature makes it easy to keep your tables up to date as your jobs write new data into Amazon S3. This makes the data immediately available for query from any analytics service that is compatible with the Amazon Glue Data Catalog. For more information, see Configuring data target nodes. | February 5, 2021 |
Job scheduling now available in Amazon Glue Studio | You can define a time-based schedule for your job runs in Amazon Glue Studio. You can use
the console to create a basic schedule, or define a more complex schedule using
the Unix-like cron | December 21, 2020 |
Amazon Glue Custom Connectors released | Amazon Glue Custom Connectors allow you to discover and subscribe to connectors in Amazon Web Services Marketplace. We also released Amazon Glue Spark runtime interfaces to plug in connectors built for Apache Spark Datasource, Athena federated query, and JDBC APIs. For more information, see Using Connectors and connections with Amazon Glue Studio. | December 21, 2020 |
Support for running streaming ETL jobs in Amazon Glue version 2.0 | Amazon Glue Studio now supports running streaming ETL jobs using Amazon Glue version 2.0. For more information, see Adding Streaming ETL Jobs in Amazon Glue in the Amazon Glue Developer Guide. | November 11, 2020 |
Availability of Amazon Glue Studio announced | Amazon Glue Studio provides a visual interface that simplifies the creation of jobs that prepare the data for analysis. The initial version of this guide was published on the same day Amazon Glue Studio launched. | September 23, 2020 |