Creating a workflow from a blueprint in Amazon Glue - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China.

Creating a workflow from a blueprint in Amazon Glue

You can create an Amazon Glue workflow manually, adding one component at a time, or you can create a workflow from an Amazon Glue blueprint. Amazon Glue includes blueprints for common use cases. Your Amazon Glue developers can create additional blueprints.

Important

Limit the total number of jobs, crawlers, and triggers within a workflow to 100 or less. If you include more than 100, you might get errors when trying to resume or stop workflow runs.

When you use a blueprint, you can quickly generate a workflow for a specific use case based on the generalized use case defined by the blueprint. You define the specific use case by providing values for the blueprint parameters. For example, a blueprint that partitions a dataset could have the Amazon S3 source and target paths as parameters.

Amazon Glue creates a workflow from a blueprint by running the blueprint. The blueprint run saves the parameter values that you supplied, and is used to track the progress and outcome of the creation of the workflow and its components. When troubleshooting a workflow, you can view the blueprint run to determine the blueprint parameter values that were used to create a workflow.

To create and view workflows, you require certain IAM permissions. For a suggested IAM policy, see Data analyst permissions for blueprints.

You can create a workflow from a blueprint by using the Amazon Glue console, Amazon Glue API, or Amazon Command Line Interface (Amazon CLI).

To create a workflow from a blueprint (console)

  1. Open the Amazon Glue console at https://console.amazonaws.cn/glue/.

    Sign in as a user that has permissions to create a workflow.

  2. In the navigation pane, choose blueprints.

  3. Select a blueprint, and on the Actions menu, choose Create workflow.

  4. On the Create a workflow from <blueprint-name> page, enter the following information:

    Blueprint parameters

    These vary depending on the blueprint design. For questions about the parameters, see the developer. blueprints typically include a parameter for the workflow name.

    IAM role

    The role that Amazon Glue assumes to create the workflow and its components. The role must have permissions to create and delete workflows, jobs, crawlers, and triggers. For a suggested policy for the role, see Permissions for blueprint roles.

  5. Choose Submit.

    The Blueprint Details page appears, showing a list of blueprint runs at the bottom.

  6. In the blueprint runs list, check the topmost blueprint run for workflow creation status.

    The initial status is RUNNING. Choose the refresh button until the status goes to SUCCEEDED or FAILED.

  7. Do one of the following:

    • If the completion status is SUCCEEDED, you can go to the Workflows page, select the newly created workflow, and run it. Before running the workflow, you can review the design graph.

    • If the completion status is FAILED, select the blueprint run, and on the Actions menu, choose View to see the error message.

For more information on workflows and blueprints, see the following topics.