AWS::Kendra::DataSource S3DataSourceConfiguration
Provides the configuration information to connect to an Amazon S3 bucket.
Note
Amazon Kendra now supports an upgraded Amazon S3 connector.
You must now use the TemplateConfiguration object instead of the
S3DataSourceConfiguration
object to configure your connector.
Connectors configured using the older console and API architecture will continue to function as configured. However, you won't be able to edit or update them. If you want to edit or update your connector configuration, you must create a new connector.
We recommended migrating your connector workflow to the upgraded version. Support for connectors configured using the older architecture is scheduled to end by June 2024.
Syntax
To declare this entity in your Amazon CloudFormation template, use the following syntax:
JSON
{ "AccessControlListConfiguration" :
AccessControlListConfiguration
, "BucketName" :String
, "DocumentsMetadataConfiguration" :DocumentsMetadataConfiguration
, "ExclusionPatterns" :[ String, ... ]
, "InclusionPatterns" :[ String, ... ]
, "InclusionPrefixes" :[ String, ... ]
}
YAML
AccessControlListConfiguration:
AccessControlListConfiguration
BucketName:String
DocumentsMetadataConfiguration:DocumentsMetadataConfiguration
ExclusionPatterns:- String
InclusionPatterns:- String
InclusionPrefixes:- String
Properties
AccessControlListConfiguration
-
Provides the path to the S3 bucket that contains the user context filtering files for the data source. For the format of the file, see Access control for S3 data sources.
Required: No
Type: AccessControlListConfiguration
Update requires: No interruption
BucketName
-
The name of the bucket that contains the documents.
Required: Yes
Type: String
Pattern:
[a-z0-9][\.\-a-z0-9]{1,61}[a-z0-9]
Minimum:
3
Maximum:
63
Update requires: No interruption
DocumentsMetadataConfiguration
-
Specifies document metadata files that contain information such as the document access control information, source URI, document author, and custom attributes. Each metadata file contains metadata about a single document.
Required: No
Type: DocumentsMetadataConfiguration
Update requires: No interruption
ExclusionPatterns
-
A list of glob patterns (patterns that can expand a wildcard pattern into a list of path names that match the given pattern) for certain file names and file types to exclude from your index. If a document matches both an inclusion and exclusion prefix or pattern, the exclusion prefix takes precendence and the document is not indexed. Examples of glob patterns include:
-
/myapp/config/*—All files inside config directory.
-
**/*.png—All .png files in all directories.
-
**/*.{png, ico, md}—All .png, .ico or .md files in all directories.
-
/myapp/src/**/*.ts—All .ts files inside src directory (and all its subdirectories).
-
**/!(*.module).ts—All .ts files but not .module.ts
-
*.png , *.jpg—All PNG and JPEG image files in a directory (files with the extensions .png and .jpg).
-
*internal*—All files in a directory that contain 'internal' in the file name, such as 'internal', 'internal_only', 'company_internal'.
-
**/*internal*—All internal-related files in a directory and its subdirectories.
For more examples, see Use of Exclude and Include Filters in the Amazon CLI Command Reference.
Required: No
Type: Array of String
Minimum:
1
Maximum:
50 | 100
Update requires: No interruption
-
InclusionPatterns
-
A list of glob patterns (patterns that can expand a wildcard pattern into a list of path names that match the given pattern) for certain file names and file types to include in your index. If a document matches both an inclusion and exclusion prefix or pattern, the exclusion prefix takes precendence and the document is not indexed. Examples of glob patterns include:
-
/myapp/config/*—All files inside config directory.
-
**/*.png—All .png files in all directories.
-
**/*.{png, ico, md}—All .png, .ico or .md files in all directories.
-
/myapp/src/**/*.ts—All .ts files inside src directory (and all its subdirectories).
-
**/!(*.module).ts—All .ts files but not .module.ts
-
*.png , *.jpg—All PNG and JPEG image files in a directory (files with the extensions .png and .jpg).
-
*internal*—All files in a directory that contain 'internal' in the file name, such as 'internal', 'internal_only', 'company_internal'.
-
**/*internal*—All internal-related files in a directory and its subdirectories.
For more examples, see Use of Exclude and Include Filters in the Amazon CLI Command Reference.
Required: No
Type: Array of String
Minimum:
1
Maximum:
50 | 100
Update requires: No interruption
-
InclusionPrefixes
-
A list of S3 prefixes for the documents that should be included in the index.
Required: No
Type: Array of String
Minimum:
1
Maximum:
50 | 100
Update requires: No interruption