AWS::Comprehend::DocumentClassifier AugmentedManifestsListItem
An augmented manifest file that provides training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
Syntax
To declare this entity in your Amazon CloudFormation template, use the following syntax:
JSON
{ "AttributeNames" :
[ String, ... ]
, "S3Uri" :String
, "Split" :String
}
YAML
AttributeNames:
- String
S3Uri:String
Split:String
Properties
AttributeNames
-
The JSON attribute that contains the annotations for your training documents. The number of attribute names that you specify depends on whether your augmented manifest file is the output of a single labeling job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
Required: Yes
Type: Array of String
Minimum:
1
Maximum:
63
Update requires: Replacement
S3Uri
-
The Amazon S3 location of the augmented manifest file.
Required: Yes
Type: String
Pattern:
s3://[a-z0-9][\.\-a-z0-9]{1,61}[a-z0-9](/.*)?
Maximum:
1024
Update requires: Replacement
Split
-
The purpose of the data you've provided in the augmented manifest. You can either train or test this data. If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
Required: No
Type: String
Allowed values:
TRAIN | TEST
Update requires: Replacement