ABBYY FlexiCapture Extraction activity

This activity is used to classify and extract documents with ABBYY FlexiCapture.

If for some reason you need to add this activity to the Process Designer, the file to add isxboundActOcfAbbyyFlexiExtract.dll.

These settings are available when configuring a process step of this activity type. (For information about configuring process steps, see XBOUND Help.)

Project file

The settings for classification and extraction are stored in a project file, which you create using the ABBYY FlexiCapture Administrator Station or Operator Station. Specify the path to this file here.

Process unprocessed or invalid documents only

Select to extract only those documents that have not been successfully validated yet.

Save classify/extraction duration into sub-documents

Select to store the processing duration per document.

Check if number of images is same as in document definition (only for classification option None)

Select to check whether each document contains the same number of images as specified in the ABBYY document definition.

If you use document definitions with a variable number of images, do not select this option.

Process images in the memory

 

Select if images should be processed in the memory (images will not be written to the file system).

Note: Activate this option only if the application server has enough memory and the root documents are not too big.

Include section names in field names

Select to include section names in field names. (ClosedExample.) Activate this option only if the document definitions contain several sections with the same field names.

  • Without the full field path: Document definition.field name

  • With the full field path: Document definition.section.field name

Warning: If you select this option in an already existing parameter set, all mapping information will be lost.

If this option is not selected and the document definition contains several sections with the same field names, the value of the last field will be written to the XBOUND field. (ClosedExample.)

A document definition has two sections with an Age field:

  • Document definition.section1.Age: extracted value = 22

  • Document definition.section2.Age: extracted value = 33

Then the value 33 will be written to the XBOUND field.

Process fields without media relation

Select if fields without a media assignment (such as service fields or calculated fields) are to be processed. These fields will be assigned to the first medium of the document.

Image selection

Select whether to extract TIFF or JPEG images.

Apply to following document types

Select to extract only documents of certain document types. Then select all the document types that are to be extracted. If documents without types are to be extracted as well, select the No document type option.

OCR mapping button

Opens the OCR mapping dialog, where you can configure how documents are to be classified and assign the interpretation results to XBOUND fields.

These settings are also available:

Import button

Imports settings from an XML file that was previously created using Export.

Export button

Exports the settings to an XML file. Specify the file name and location. You can then import the XML file to get the same settings.

Check regular expression link

Opens a test form, where you can check a regular expression.

ReadSoft Capture Framework activities: Overview

XBOUND activities: Overview (XBOUND Help topic)