AWS::KinesisFirehose::DeliveryStream ParquetSerDe - Amazon CloudFormation
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

AWS::KinesisFirehose::DeliveryStream ParquetSerDe

A serializer to use for converting data to the Parquet format before storing it in Amazon S3. For more information, see Apache Parquet.

Syntax

To declare this entity in your Amazon CloudFormation template, use the following syntax:

JSON

{ "BlockSizeBytes" : Integer, "Compression" : String, "EnableDictionaryCompression" : Boolean, "MaxPaddingBytes" : Integer, "PageSizeBytes" : Integer, "WriterVersion" : String }

YAML

BlockSizeBytes: Integer Compression: String EnableDictionaryCompression: Boolean MaxPaddingBytes: Integer PageSizeBytes: Integer WriterVersion: String

Properties

BlockSizeBytes

The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.

Required: No

Type: Integer

Minimum: 67108864

Update requires: No interruption

Compression

The compression code to use over data blocks. The possible values are UNCOMPRESSED, SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY for higher decompression speed. Use GZIP if the compression ratio is more important than speed.

Required: No

Type: String

Allowed values: UNCOMPRESSED | GZIP | SNAPPY

Update requires: No interruption

EnableDictionaryCompression

Indicates whether to enable dictionary compression.

Required: No

Type: Boolean

Update requires: No interruption

MaxPaddingBytes

The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.

Required: No

Type: Integer

Minimum: 0

Update requires: No interruption

PageSizeBytes

The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.

Required: No

Type: Integer

Minimum: 65536

Update requires: No interruption

WriterVersion

Indicates the version of row format to output. The possible values are V1 and V2. The default is V1.

Required: No

Type: String

Allowed values: V1 | V2

Update requires: No interruption