REPLACE_WITH_MOST_FREQUENT - Amazon Glue DataBrew
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

REPLACE_WITH_MOST_FREQUENT

Replaces each invalid value in a column with the most frequent column value.

Parameters
  • sourceColumn – The name of an existing column.

  • columnDataType – The data type of the column.

  • advancedDataType – Special data types that are detected by DataBrew in a column that has the data type string. The types that DataBrew can detect within a string column include SSN, Email, Phone Number, Gender, Credit Card, URL, IP Address, DateTime, Currency, ZipCode, Country, Region, State, and City.

Example

{ "RecipeAction": { "Operation": "REPLACE_WITH_MOST_FREQUENT", "Parameters": { "columnDataType": "string", "sourceColumn": "wind_direction" } } }