Parameters set on Data Catalog tables by crawler - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China.

Parameters set on Data Catalog tables by crawler

These table properties are set by Amazon Glue crawlers. We expect users to consume the classification and compressionType properties. Other properties, including table size estimates, are used for internal calculations, and we do not guarantee their accuracy or applicability to customer use cases. Changing these parameters may alter the behavior of the crawler, we do not support this workflow.

Property key Property value
UPDATED_BY_CRAWLER

Name of crawler performing update.

recordCount

Estimate count of records in table, based on file sizes and headers.

skip.header.line.count

Rows skipped to skip header. Set on tables classified as CSV.

CrawlerSchemaSerializerVersion

For internal use

classification

Format of data, inferred by crawler. For more information about data formats supported by Amazon Glue crawlers see Built-in classifiers in Amazon Glue.

CrawlerSchemaDeserializerVersion

For internal use

sizeKey

Combined size of files in table crawled.

averageRecordSize

Average size of row in table, in bytes.

compressionType

Type of compression used on data in the table. For more information about compression types supported by Amazon Glue crawlers see Built-in classifiers in Amazon Glue.

typeOfData

file, table or view.

objectCount

Number of objects under Amazon S3 path for table.