startDocumentAnalysis method
- required DocumentLocation documentLocation,
- required List<
FeatureType> featureTypes, - String? clientRequestToken,
- String? jobTag,
- String? kMSKeyId,
- NotificationChannel? notificationChannel,
- OutputConfig? outputConfig,
Starts the asynchronous analysis of an input document for relationships between detected items such as key-value pairs, tables, and selection elements.
StartDocumentAnalysis
can analyze text in documents that are
in JPEG, PNG, and PDF format. The documents are stored in an Amazon S3
bucket. Use DocumentLocation to specify the bucket name and file
name of the document.
StartDocumentAnalysis
returns a job identifier
(JobId
) that you use to get the results of the operation.
When text analysis is finished, Amazon Textract publishes a completion
status to the Amazon Simple Notification Service (Amazon SNS) topic that
you specify in NotificationChannel
. To get the results of the
text analysis operation, first check that the status value published to
the Amazon SNS topic is SUCCEEDED
. If so, call
GetDocumentAnalysis, and pass the job identifier
(JobId
) from the initial call to
StartDocumentAnalysis
.
For more information, see Document Text Analysis.
May throw InvalidParameterException. May throw InvalidS3ObjectException. May throw InvalidKMSKeyException. May throw UnsupportedDocumentException. May throw DocumentTooLargeException. May throw BadDocumentException. May throw AccessDeniedException. May throw ProvisionedThroughputExceededException. May throw InternalServerError. May throw IdempotentParameterMismatchException. May throw ThrottlingException. May throw LimitExceededException.
Parameter documentLocation
:
The location of the document to be processed.
Parameter featureTypes
:
A list of the types of analysis to perform. Add TABLES to the list to
return information about the tables that are detected in the input
document. Add FORMS to return detected form data. To perform both types of
analysis, add TABLES and FORMS to FeatureTypes
. All lines and
words detected in the document are included in the response (including
text that isn't related to the value of FeatureTypes
).
Parameter clientRequestToken
:
The idempotent token that you use to identify the start request. If you
use the same token with multiple StartDocumentAnalysis
requests, the same JobId
is returned. Use
ClientRequestToken
to prevent the same job from being
accidentally started more than once. For more information, see Calling
Amazon Textract Asynchronous Operations.
Parameter jobTag
:
An identifier that you specify that's included in the completion
notification published to the Amazon SNS topic. For example, you can use
JobTag
to identify the type of document that the completion
notification corresponds to (such as a tax form or a receipt).
Parameter kMSKeyId
:
The KMS key used to encrypt the inference results. This can be in either
Key ID or Key Alias format. When a KMS key is provided, the KMS key will
be used for server-side encryption of the objects in the customer bucket.
When this parameter is not enabled, the result will be encrypted server
side,using SSE-S3.
Parameter notificationChannel
:
The Amazon SNS topic ARN that you want Amazon Textract to publish the
completion status of the operation to.
Parameter outputConfig
:
Sets if the output will go to a customer defined bucket. By default,
Amazon Textract will save the results internally to be accessed by the
GetDocumentAnalysis operation.
Implementation
Future<StartDocumentAnalysisResponse> startDocumentAnalysis({
required DocumentLocation documentLocation,
required List<FeatureType> featureTypes,
String? clientRequestToken,
String? jobTag,
String? kMSKeyId,
NotificationChannel? notificationChannel,
OutputConfig? outputConfig,
}) async {
ArgumentError.checkNotNull(documentLocation, 'documentLocation');
ArgumentError.checkNotNull(featureTypes, 'featureTypes');
_s.validateStringLength(
'clientRequestToken',
clientRequestToken,
1,
64,
);
_s.validateStringLength(
'jobTag',
jobTag,
1,
64,
);
_s.validateStringLength(
'kMSKeyId',
kMSKeyId,
1,
2048,
);
final headers = <String, String>{
'Content-Type': 'application/x-amz-json-1.1',
'X-Amz-Target': 'Textract.StartDocumentAnalysis'
};
final jsonResponse = await _protocol.send(
method: 'POST',
requestUri: '/',
exceptionFnMap: _exceptionFns,
// TODO queryParams
headers: headers,
payload: {
'DocumentLocation': documentLocation,
'FeatureTypes': featureTypes.map((e) => e.toValue()).toList(),
if (clientRequestToken != null)
'ClientRequestToken': clientRequestToken,
if (jobTag != null) 'JobTag': jobTag,
if (kMSKeyId != null) 'KMSKeyId': kMSKeyId,
if (notificationChannel != null)
'NotificationChannel': notificationChannel,
if (outputConfig != null) 'OutputConfig': outputConfig,
},
);
return StartDocumentAnalysisResponse.fromJson(jsonResponse.body);
}