Configuring a crawler

A crawler accesses your data store, identifies metadata, and creates table definitions in the Amazon Glue Data Catalog. The Crawlers pane in the Amazon Glue console lists all the crawlers that you create. The list displays status and metrics from the last run of your crawler.

This topic contains the step-by-step process of configuring a crawler, covering essential aspects such as setting up the crawler's parameters, defining the data sources to crawl, setting up security, and managing the crawled data.

Topics

Step 1: Set crawler properties
Step 2: Choose data sources and classifiers
Step 3: Configure security settings
Step 4: Set output and scheduling
Step 5: Review and create

Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Creating classifiers on the console

Set crawler properties