Using Apache Iceberg framework in Amazon Glue Studio

Using Apache Iceberg framework in data targets

From the Target menu, choose Amazon Glue Studio Data Catalog.
In the Data source properties tab, choose a database and table.
Amazon Glue Studio displays the format type as Apache Iceberg and the Amazon S3 URL.

Enter values or select from the available options to configure Apache Iceberg format.

Format – choose Apache Iceberg from the drop-down menu.
Amazon S3 Target Location – choose the Amazon S3 target location by clicking Browse S3.
Data Catalog update options – Create a table in the Data Catalog and on subsequent runs, keep existing schema and add new partitions must be selected to proceed. Writing a new Iceberg table using Amazon Glue requires the Data Catalog to be configured as the catalog for the Iceberg table. To update an existing Iceberg table that has been registered in the Data Catalog, choose Data Catalog as the target.
- Database – Choose the database from the Data Catalog.
- Table Name – Enter the value for your table name. Apache Iceberg table names must be in all lower case. Use underscores if needed since spaces are not allowed. For example "data_lake_format_tables".

From the Source menu, choose Amazon Glue Studio Data Catalog.
In the Data source properties tab, choose a database and table.
Amazon Glue Studio displays the format type as Apache Iceberg and the Amazon S3 URL.

Apache Iceberg is not available as a data option for Amazon S3 source nodes in Amazon Glue Studio.

Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Using Delta Lake framework in Amazon Glue Studio

Connecting to data sources