Copyright | (c) 2013-2021 Brendan Hay |
---|---|
License | Mozilla Public License, v. 2.0. |
Maintainer | Brendan Hay <brendan.g.hay+amazonka@gmail.com> |
Stability | auto-generated |
Portability | non-portable (GHC extensions) |
Safe Haskell | None |
Synopsis
- data InputDataConfig = InputDataConfig' {}
- newInputDataConfig :: Text -> InputDataConfig
- inputDataConfig_documentReaderConfig :: Lens' InputDataConfig (Maybe DocumentReaderConfig)
- inputDataConfig_inputFormat :: Lens' InputDataConfig (Maybe InputFormat)
- inputDataConfig_s3Uri :: Lens' InputDataConfig Text
Documentation
data InputDataConfig Source #
The input properties for an inference job.
See: newInputDataConfig
smart constructor.
InputDataConfig' | |
|
Instances
Create a value of InputDataConfig
with all optional fields omitted.
Use generic-lens or optics to modify other optional fields.
The following record fields are available, with the corresponding lenses provided for backwards compatibility:
$sel:documentReaderConfig:InputDataConfig'
, inputDataConfig_documentReaderConfig
- The document reader config field applies only for InputDataConfig of
StartEntitiesDetectionJob.
Use DocumentReaderConfig to provide specifications about how you want your inference documents read. Currently it applies for PDF documents in StartEntitiesDetectionJob custom inference.
$sel:inputFormat:InputDataConfig'
, inputDataConfig_inputFormat
- Specifies how the text in an input file should be processed:
ONE_DOC_PER_FILE
- Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers.ONE_DOC_PER_LINE
- Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.
$sel:s3Uri:InputDataConfig'
, inputDataConfig_s3Uri
- The Amazon S3 URI for the input data. The URI must be in same region as
the API endpoint that you are calling. The URI can point to a single
input file or it can provide the prefix for a collection of data files.
For example, if you use the URI S3://bucketName/prefix
, if the
prefix is a single file, Amazon Comprehend uses that file as input. If
more than one file begins with the prefix, Amazon Comprehend uses all of
them as input.
inputDataConfig_documentReaderConfig :: Lens' InputDataConfig (Maybe DocumentReaderConfig) Source #
The document reader config field applies only for InputDataConfig of StartEntitiesDetectionJob.
Use DocumentReaderConfig to provide specifications about how you want your inference documents read. Currently it applies for PDF documents in StartEntitiesDetectionJob custom inference.
inputDataConfig_inputFormat :: Lens' InputDataConfig (Maybe InputFormat) Source #
Specifies how the text in an input file should be processed:
ONE_DOC_PER_FILE
- Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers.ONE_DOC_PER_LINE
- Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.
inputDataConfig_s3Uri :: Lens' InputDataConfig Text Source #
The Amazon S3 URI for the input data. The URI must be in same region as the API endpoint that you are calling. The URI can point to a single input file or it can provide the prefix for a collection of data files.
For example, if you use the URI S3://bucketName/prefix
, if the
prefix is a single file, Amazon Comprehend uses that file as input. If
more than one file begins with the prefix, Amazon Comprehend uses all of
them as input.