AWS::Kendra::DataSource S3DataSourceConfiguration - Amazon CloudFormation
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

AWS::Kendra::DataSource S3DataSourceConfiguration

Provides the configuration information to connect to an Amazon S3 bucket.

Note

Amazon Kendra now supports an upgraded Amazon S3 connector.

You must now use the TemplateConfiguration object instead of the S3DataSourceConfiguration object to configure your connector.

Connectors configured using the older console and API architecture will continue to function as configured. However, you won't be able to edit or update them. If you want to edit or update your connector configuration, you must create a new connector.

We recommended migrating your connector workflow to the upgraded version. Support for connectors configured using the older architecture is scheduled to end by June 2024.

Syntax

To declare this entity in your Amazon CloudFormation template, use the following syntax:

Properties

AccessControlListConfiguration

Provides the path to the S3 bucket that contains the user context filtering files for the data source. For the format of the file, see Access control for S3 data sources.

Required: No

Type: AccessControlListConfiguration

Update requires: No interruption

BucketName

The name of the bucket that contains the documents.

Required: Yes

Type: String

Pattern: [a-z0-9][\.\-a-z0-9]{1,61}[a-z0-9]

Minimum: 3

Maximum: 63

Update requires: No interruption

DocumentsMetadataConfiguration

Specifies document metadata files that contain information such as the document access control information, source URI, document author, and custom attributes. Each metadata file contains metadata about a single document.

Required: No

Type: DocumentsMetadataConfiguration

Update requires: No interruption

ExclusionPatterns

A list of glob patterns (patterns that can expand a wildcard pattern into a list of path names that match the given pattern) for certain file names and file types to exclude from your index. If a document matches both an inclusion and exclusion prefix or pattern, the exclusion prefix takes precendence and the document is not indexed. Examples of glob patterns include:

  • /myapp/config/*—All files inside config directory.

  • **/*.png—All .png files in all directories.

  • **/*.{png, ico, md}—All .png, .ico or .md files in all directories.

  • /myapp/src/**/*.ts—All .ts files inside src directory (and all its subdirectories).

  • **/!(*.module).ts—All .ts files but not .module.ts

  • *.png , *.jpg—All PNG and JPEG image files in a directory (files with the extensions .png and .jpg).

  • *internal*—All files in a directory that contain 'internal' in the file name, such as 'internal', 'internal_only', 'company_internal'.

  • **/*internal*—All internal-related files in a directory and its subdirectories.

For more examples, see Use of Exclude and Include Filters in the Amazon CLI Command Reference.

Required: No

Type: Array of String

Minimum: 1

Maximum: 50 | 100

Update requires: No interruption

InclusionPatterns

A list of glob patterns (patterns that can expand a wildcard pattern into a list of path names that match the given pattern) for certain file names and file types to include in your index. If a document matches both an inclusion and exclusion prefix or pattern, the exclusion prefix takes precendence and the document is not indexed. Examples of glob patterns include:

  • /myapp/config/*—All files inside config directory.

  • **/*.png—All .png files in all directories.

  • **/*.{png, ico, md}—All .png, .ico or .md files in all directories.

  • /myapp/src/**/*.ts—All .ts files inside src directory (and all its subdirectories).

  • **/!(*.module).ts—All .ts files but not .module.ts

  • *.png , *.jpg—All PNG and JPEG image files in a directory (files with the extensions .png and .jpg).

  • *internal*—All files in a directory that contain 'internal' in the file name, such as 'internal', 'internal_only', 'company_internal'.

  • **/*internal*—All internal-related files in a directory and its subdirectories.

For more examples, see Use of Exclude and Include Filters in the Amazon CLI Command Reference.

Required: No

Type: Array of String

Minimum: 1

Maximum: 50 | 100

Update requires: No interruption

InclusionPrefixes

A list of S3 prefixes for the documents that should be included in the index.

Required: No

Type: Array of String

Minimum: 1

Maximum: 50 | 100

Update requires: No interruption