Sensitive data detection API - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China.

Sensitive data detection API

The Sensitive data detection API describes the APIs used to detect sensitive data across the columns and rows of your structured data.

Data types

CustomEntityType structure

An object representing a custom pattern for detecting sensitive data across the columns and rows of your structured data.

Fields

  • NameRequired: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    A name for the custom pattern that allows it to be retrieved or deleted later. This name must be unique per Amazon account.

  • RegexStringRequired: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    A regular expression string that is used for detecting sensitive data in a custom pattern.

  • ContextWords – An array of UTF-8 strings, not less than 1 or more than 20 strings.

    A list of context words. If none of these context words are found within the vicinity of the regular expression the data will not be detected as sensitive data.

    If no context words are passed only a regular expression is checked.

Operations

CreateCustomEntityType action (Python: create_custom_entity_type)

Creates a custom pattern that is used to detect sensitive data across the columns and rows of your structured data.

Each custom pattern you create specifies a regular expression and an optional list of context words. If no context words are passed only a regular expression is checked.

Request

  • NameRequired: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    A name for the custom pattern that allows it to be retrieved or deleted later. This name must be unique per Amazon account.

  • RegexStringRequired: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    A regular expression string that is used for detecting sensitive data in a custom pattern.

  • ContextWords – An array of UTF-8 strings, not less than 1 or more than 20 strings.

    A list of context words. If none of these context words are found within the vicinity of the regular expression the data will not be detected as sensitive data.

    If no context words are passed only a regular expression is checked.

Response

  • Name – UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    The name of the custom pattern you created.

Errors

  • AccessDeniedException

  • AlreadyExistsException

  • IdempotentParameterMismatchException

  • InternalServiceException

  • InvalidInputException

  • OperationTimeoutException

  • ResourceNumberLimitExceededException

DeleteCustomEntityType action (Python: delete_custom_entity_type)

Deletes a custom pattern by specifying its name.

Request

  • NameRequired: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    The name of the custom pattern that you want to delete.

Response

  • Name – UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    The name of the custom pattern you deleted.

Errors

  • EntityNotFoundException

  • AccessDeniedException

  • InternalServiceException

  • InvalidInputException

  • OperationTimeoutException

GetCustomEntityType action (Python: get_custom_entity_type)

Retrieves the details of a custom pattern by specifying its name.

Request

  • NameRequired: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    The name of the custom pattern that you want to retrieve.

Response

  • Name – UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    The name of the custom pattern that you retrieved.

  • RegexString – UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.

    A regular expression string that is used for detecting sensitive data in a custom pattern.

  • ContextWords – An array of UTF-8 strings, not less than 1 or more than 20 strings.

    A list of context words if specified when you created the custom pattern. If none of these context words are found within the vicinity of the regular expression the data will not be detected as sensitive data.

Errors

  • EntityNotFoundException

  • AccessDeniedException

  • InternalServiceException

  • InvalidInputException

  • OperationTimeoutException

BatchGetCustomEntityTypes action (Python: batch_get_custom_entity_types)

Retrieves the details for the custom patterns specified by a list of names.

Request

  • NamesRequired: An array of UTF-8 strings, not less than 1 or more than 50 strings.

    A list of names of the custom patterns that you want to retrieve.

Response

  • CustomEntityTypes – An array of CustomEntityType objects.

    A list of CustomEntityType objects representing the custom patterns that have been created.

  • CustomEntityTypesNotFound – An array of UTF-8 strings, not less than 1 or more than 50 strings.

    A list of the names of custom patterns that were not found.

Errors

  • InvalidInputException

  • InternalServiceException

  • OperationTimeoutException

ListCustomEntityTypes action (Python: list_custom_entity_types)

Lists all the custom patterns that have been created.

Request

  • NextToken – UTF-8 string.

    A paginated token to offset the results.

  • MaxResults – Number (integer), not less than 1 or more than 1000.

    The maximum number of results to return.

Response

  • CustomEntityTypes – An array of CustomEntityType objects.

    A list of CustomEntityType objects representing custom patterns.

  • NextToken – UTF-8 string.

    A pagination token, if more results are available.

Errors

  • InvalidInputException

  • OperationTimeoutException

  • InternalServiceException