Performing complex ETL activities using blueprints and workflows in Amazon Glue
Some of your organization's complex extract, transform, and load (ETL) processes might best be implemented by using multiple, dependent Amazon Glue jobs and crawlers. Using Amazon Glue workflows, you can design a complex multi-job, multi-crawler ETL process that Amazon Glue can run and track as single entity. After you create a workflow and specify the jobs, crawlers, and triggers in the workflow, you can run the workflow on demand or on a schedule.
Topics
- Overview of workflows in Amazon Glue
- Creating and building out a workflow manually in Amazon Glue
- Starting an Amazon Glue workflow with an Amazon EventBridge event
- Viewing the EventBridge events that started a workflow
- Running and monitoring a workflow in Amazon Glue
- Stopping a workflow run
- Repairing and resuming a workflow run
- Getting and setting workflow run properties in Amazon Glue
- Querying workflows using the Amazon Glue API
- Blueprint and workflow restrictions in Amazon Glue
- Troubleshooting blueprint errors in Amazon Glue
- Permissions for personas and roles for Amazon Glue blueprints