Connecting to data with Amazon Glue DataBrew
In Amazon Glue DataBrew, a dataset represents data that's either uploaded from a file or stored elsewhere. For example, data can be stored in Amazon S3, in a supported JDBC data source, or an Amazon Glue Data Catalog. If you're not uploading a file directly to DataBrew, the dataset also contains details on how DataBrew can connect to the data.
When you create your dataset (for example, inventory-dataset
), you enter the
connection details only once. From that point, DataBrew can access the underlying
data for you. With this approach, you can create projects and develop transformations
for your data, without having to worry about connection details or file formats.