Product and service integrations
Use this section to know which products and services integrate with DataBrew.
DataBrew works with the following Amazon services for networking, management, and governance:
DataBrew works with the following Amazon data lakes and data stores:
DataBrew supports the following file formats and extensions for uploading data.
| Format | File extension (optional) | Extensions for compressed files (required) |
|---|---|---|
|
Comma-separated values |
|
|
| Microsoft Excel workbook |
|
No compression support |
|
JSON (JSON document and JSON lines) |
|
|
| Apache ORC |
|
|
| Apache Parquet |
|
|
DataBrew writes output files to Amazon S3, and supports the following file formats and extensions.
| Format | File extension (uncompressed) | File extensions (compressed) |
|---|---|---|
|
Comma-separated values |
.csv |
.csv.snappy, .csv.gz,
.csv.lz4, csv.bz2,
.csv.deflate, csv.br |
|
Tab-separated values |
.csv |
.tsv.snappy, .tsv.gz,
.tsv.lz4, tsv.bz2,
.tsv.deflate, tsv.br |
| Apache Parquet | .parquet |
.parquet.snappy, .parquet.gz,
.parquet.lz4, .parquet.lzo,
.parquet.br |
| Amazon Glue Parquet | Not supported | .glue.parquet.snappy |
| Apache Avro | .avro |
.avro.snappy, .avro.gz,
.avro.lz4, .avro.bz2,
.avro.deflate, .avro.br |
| Apache ORC | .orc |
.orc.snappy, .orc.lzo,
.orc.zlib |
| XML | .xml |
.xml.snappy, .xml.gz,
.xml.lz4, .xml.bz2,
.xml.deflate, .xml.br |
| JSON (JSON Lines format only) |
.json
|
.json.snappy, .json.gz,
.json.lz4, json.bz2,
.json.deflate, .json.br |
| Tableau Hyper | Not supported | Not applicable |