Product and service integrations - Amazon Glue DataBrew
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Product and service integrations

Use this section to know which products and services integrate with DataBrew.

DataBrew works with the following Amazon services for networking, management, and governance:

DataBrew works with the following Amazon data lakes and data stores:

DataBrew supports the following file formats and extensions for uploading data.

Format File extension (optional) Extensions for compressed files (required)

Comma-separated values

.csv

.gz

.snappy

.lz4

.bz2

.deflate

Microsoft Excel workbook

.xlsx

No compression support

JSON (JSON document and JSON lines)

.json, .jsonl

.gz

.snappy

.lz4

.bz2

.deflate

Apache ORC

.orc

.zlib

.snappy

Apache Parquet

.parquet

.gz

.snappy

.lz4

DataBrew writes output files to Amazon S3, and supports the following file formats and extensions.

Format File extension (uncompressed) File extensions (compressed)

Comma-separated values

.csv .csv.snappy, .csv.gz, .csv.lz4, csv.bz2, .csv.deflate, csv.br

Tab-separated values

.csv .tsv.snappy, .tsv.gz, .tsv.lz4, tsv.bz2, .tsv.deflate, tsv.br
Apache Parquet .parquet .parquet.snappy, .parquet.gz, .parquet.lz4, .parquet.lzo, .parquet.br
Amazon Glue Parquet Not supported .glue.parquet.snappy
Apache Avro .avro .avro.snappy, .avro.gz, .avro.lz4, .avro.bz2, .avro.deflate, .avro.br
Apache ORC .orc .orc.snappy, .orc.lzo, .orc.zlib
XML .xml .xml.snappy, .xml.gz, .xml.lz4, .xml.bz2, .xml.deflate, .xml.br
JSON (JSON Lines format only) .json .json.snappy, .json.gz, .json.lz4, json.bz2, .json.deflate, .json.br
Tableau Hyper Not supported Not applicable