FindMatches class - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

FindMatches class

Identifies matching records in the input DynamicFrame and creates a new DynamicFrame with a unique identifier assigned to each group of matching records.

To import:

from awsglueml.transforms import FindMatches

Methods

apply(frame, transformId, transformation_ctx = "", info = "", stageThreshold = 0, totalThreshold = 0, enforcedMatches = none, computeMatchConfidenceScores = 0)

Identifies matching records in the input DynamicFrame and creates a new DynamicFrame with a unique identifier assigned to each group of matching records.

  • frame – The DynamicFrame to apply the FindMatches transform. Required.

  • transformId – The unique ID associated with the FindMatches transform to apply on records in the DynamicFrame. Required.

  • transformation_ctx – A unique string that is used to identify stats/state information. Optional.

  • info – A string to be associated with errors in the transformation. Optional.

  • stageThreshold – The maximum number of errors that can occur in the transformation before it errors out. Optional. The default is zero.

  • totalThreshold – The maximum number of errors that can occur overall before processing errors out. Optional. The default is zero.

  • enforcedMatches – The DynamicFrame used to enforce matches. Optional. The default is None.

  • computeMatchConfidenceScores – A Boolean value indicating whether to compute a confidence score for each group of matching records. Optional. The default is false.

Returns a new DynamicFrame with a unique identifier assigned to each group of matching records.