/**
* Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
* SPDX-License-Identifier: Apache-2.0.
*/
#pragma once
#include Specifies the format and location of the input data.See Also:
* AWS
* API Reference
The format of your training data:
* COMPREHEND_CSV
: A CSV file that supplements your training
* documents. The CSV file contains information about the custom entities that your
* trained model will detect. The required format of the file depends on whether
* you are providing annotations or an entity list.
If you use this value,
* you must provide your CSV file by using either the Annotations
or
* EntityList
parameters. You must provide your training documents by
* using the Documents
parameter.
* AUGMENTED_MANIFEST
: A labeled dataset that is produced by Amazon
* SageMaker Ground Truth. This file is in JSON lines format. Each line is a
* complete JSON object that contains a training document and its labels. Each
* label annotates a named entity in the training document.
If you use this
* value, you must provide the AugmentedManifests
parameter in your
* request.
If you don't specify a value, Amazon Comprehend uses
* COMPREHEND_CSV
as the default.
The format of your training data:
* COMPREHEND_CSV
: A CSV file that supplements your training
* documents. The CSV file contains information about the custom entities that your
* trained model will detect. The required format of the file depends on whether
* you are providing annotations or an entity list.
If you use this value,
* you must provide your CSV file by using either the Annotations
or
* EntityList
parameters. You must provide your training documents by
* using the Documents
parameter.
* AUGMENTED_MANIFEST
: A labeled dataset that is produced by Amazon
* SageMaker Ground Truth. This file is in JSON lines format. Each line is a
* complete JSON object that contains a training document and its labels. Each
* label annotates a named entity in the training document.
If you use this
* value, you must provide the AugmentedManifests
parameter in your
* request.
If you don't specify a value, Amazon Comprehend uses
* COMPREHEND_CSV
as the default.
The format of your training data:
* COMPREHEND_CSV
: A CSV file that supplements your training
* documents. The CSV file contains information about the custom entities that your
* trained model will detect. The required format of the file depends on whether
* you are providing annotations or an entity list.
If you use this value,
* you must provide your CSV file by using either the Annotations
or
* EntityList
parameters. You must provide your training documents by
* using the Documents
parameter.
* AUGMENTED_MANIFEST
: A labeled dataset that is produced by Amazon
* SageMaker Ground Truth. This file is in JSON lines format. Each line is a
* complete JSON object that contains a training document and its labels. Each
* label annotates a named entity in the training document.
If you use this
* value, you must provide the AugmentedManifests
parameter in your
* request.
If you don't specify a value, Amazon Comprehend uses
* COMPREHEND_CSV
as the default.
The format of your training data:
* COMPREHEND_CSV
: A CSV file that supplements your training
* documents. The CSV file contains information about the custom entities that your
* trained model will detect. The required format of the file depends on whether
* you are providing annotations or an entity list.
If you use this value,
* you must provide your CSV file by using either the Annotations
or
* EntityList
parameters. You must provide your training documents by
* using the Documents
parameter.
* AUGMENTED_MANIFEST
: A labeled dataset that is produced by Amazon
* SageMaker Ground Truth. This file is in JSON lines format. Each line is a
* complete JSON object that contains a training document and its labels. Each
* label annotates a named entity in the training document.
If you use this
* value, you must provide the AugmentedManifests
parameter in your
* request.
If you don't specify a value, Amazon Comprehend uses
* COMPREHEND_CSV
as the default.
The format of your training data:
* COMPREHEND_CSV
: A CSV file that supplements your training
* documents. The CSV file contains information about the custom entities that your
* trained model will detect. The required format of the file depends on whether
* you are providing annotations or an entity list.
If you use this value,
* you must provide your CSV file by using either the Annotations
or
* EntityList
parameters. You must provide your training documents by
* using the Documents
parameter.
* AUGMENTED_MANIFEST
: A labeled dataset that is produced by Amazon
* SageMaker Ground Truth. This file is in JSON lines format. Each line is a
* complete JSON object that contains a training document and its labels. Each
* label annotates a named entity in the training document.
If you use this
* value, you must provide the AugmentedManifests
parameter in your
* request.
If you don't specify a value, Amazon Comprehend uses
* COMPREHEND_CSV
as the default.
The format of your training data:
* COMPREHEND_CSV
: A CSV file that supplements your training
* documents. The CSV file contains information about the custom entities that your
* trained model will detect. The required format of the file depends on whether
* you are providing annotations or an entity list.
If you use this value,
* you must provide your CSV file by using either the Annotations
or
* EntityList
parameters. You must provide your training documents by
* using the Documents
parameter.
* AUGMENTED_MANIFEST
: A labeled dataset that is produced by Amazon
* SageMaker Ground Truth. This file is in JSON lines format. Each line is a
* complete JSON object that contains a training document and its labels. Each
* label annotates a named entity in the training document.
If you use this
* value, you must provide the AugmentedManifests
parameter in your
* request.
If you don't specify a value, Amazon Comprehend uses
* COMPREHEND_CSV
as the default.
The entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline const Aws::VectorThe entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline bool EntityTypesHasBeenSet() const { return m_entityTypesHasBeenSet; } /** *The entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline void SetEntityTypes(const Aws::VectorThe entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline void SetEntityTypes(Aws::VectorThe entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline EntityRecognizerInputDataConfig& WithEntityTypes(const Aws::VectorThe entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline EntityRecognizerInputDataConfig& WithEntityTypes(Aws::VectorThe entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline EntityRecognizerInputDataConfig& AddEntityTypes(const EntityTypesListItem& value) { m_entityTypesHasBeenSet = true; m_entityTypes.push_back(value); return *this; } /** *The entity types in the labeled training data that Amazon Comprehend uses to * train the custom entity recognizer. Any entity types that you don't specify are * ignored.
A maximum of 25 entity types can be used at one time to train an * entity recognizer. Entity types must not contain the following invalid * characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r * (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma). *
*/ inline EntityRecognizerInputDataConfig& AddEntityTypes(EntityTypesListItem&& value) { m_entityTypesHasBeenSet = true; m_entityTypes.push_back(std::move(value)); return *this; } /** *The S3 location of the folder that contains the training documents for your * custom entity recognizer.
This parameter is required if you set
* DataFormat
to COMPREHEND_CSV
.
The S3 location of the folder that contains the training documents for your * custom entity recognizer.
This parameter is required if you set
* DataFormat
to COMPREHEND_CSV
.
The S3 location of the folder that contains the training documents for your * custom entity recognizer.
This parameter is required if you set
* DataFormat
to COMPREHEND_CSV
.
The S3 location of the folder that contains the training documents for your * custom entity recognizer.
This parameter is required if you set
* DataFormat
to COMPREHEND_CSV
.
The S3 location of the folder that contains the training documents for your * custom entity recognizer.
This parameter is required if you set
* DataFormat
to COMPREHEND_CSV
.
The S3 location of the folder that contains the training documents for your * custom entity recognizer.
This parameter is required if you set
* DataFormat
to COMPREHEND_CSV
.
The S3 location of the CSV file that annotates your training documents.
*/ inline const EntityRecognizerAnnotations& GetAnnotations() const{ return m_annotations; } /** *The S3 location of the CSV file that annotates your training documents.
*/ inline bool AnnotationsHasBeenSet() const { return m_annotationsHasBeenSet; } /** *The S3 location of the CSV file that annotates your training documents.
*/ inline void SetAnnotations(const EntityRecognizerAnnotations& value) { m_annotationsHasBeenSet = true; m_annotations = value; } /** *The S3 location of the CSV file that annotates your training documents.
*/ inline void SetAnnotations(EntityRecognizerAnnotations&& value) { m_annotationsHasBeenSet = true; m_annotations = std::move(value); } /** *The S3 location of the CSV file that annotates your training documents.
*/ inline EntityRecognizerInputDataConfig& WithAnnotations(const EntityRecognizerAnnotations& value) { SetAnnotations(value); return *this;} /** *The S3 location of the CSV file that annotates your training documents.
*/ inline EntityRecognizerInputDataConfig& WithAnnotations(EntityRecognizerAnnotations&& value) { SetAnnotations(std::move(value)); return *this;} /** *The S3 location of the CSV file that has the entity list for your custom * entity recognizer.
*/ inline const EntityRecognizerEntityList& GetEntityList() const{ return m_entityList; } /** *The S3 location of the CSV file that has the entity list for your custom * entity recognizer.
*/ inline bool EntityListHasBeenSet() const { return m_entityListHasBeenSet; } /** *The S3 location of the CSV file that has the entity list for your custom * entity recognizer.
*/ inline void SetEntityList(const EntityRecognizerEntityList& value) { m_entityListHasBeenSet = true; m_entityList = value; } /** *The S3 location of the CSV file that has the entity list for your custom * entity recognizer.
*/ inline void SetEntityList(EntityRecognizerEntityList&& value) { m_entityListHasBeenSet = true; m_entityList = std::move(value); } /** *The S3 location of the CSV file that has the entity list for your custom * entity recognizer.
*/ inline EntityRecognizerInputDataConfig& WithEntityList(const EntityRecognizerEntityList& value) { SetEntityList(value); return *this;} /** *The S3 location of the CSV file that has the entity list for your custom * entity recognizer.
*/ inline EntityRecognizerInputDataConfig& WithEntityList(EntityRecognizerEntityList&& value) { SetEntityList(std::move(value)); return *this;} /** *A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom * model. An augmented manifest file is a labeled dataset that is produced by * Amazon SageMaker Ground Truth.
This parameter is required if you set
* DataFormat
to AUGMENTED_MANIFEST
.