This activity is used to classify and extract documents with RecoStar Professional.
Note: This activity can only run once on any given server. Therefore, when you assign the activity to an XBOUND Activities Service (see XBOUND Help for instructions), the number of instances of this activity must be 0 or 1.
If for some reason you need to add this activity to the Process Designer, the file to add isxboundActOcfRecoStarProfExtract.dll.
These settings are available when configuring a process step of this activity type. (For information about configuring process steps, see XBOUND Help.)
The settings for classification and extraction are stored in a project file, which you create using RecoStar Design Studio. Specify the path to this file here. | |
Select to extract only those documents that have not been successfully validated yet. | |
Select to store the processing duration per document. | |
Select to acquire the defined field zone if no field value has been read. | |
Select to process multi-page documents. If selected, each page must be supplied as a separate medium. Fields extracted on the pages are transferred to XBOUND depending on the mapping configuration. The field names on all forms must be unique to prevent mutual overwriting. For multi-page documents, normally the RecoStar project contains a separate form for each page. When mapping RecoStar forms to XBOUND document types, only the determined form of the first page is considered. | |
On repeated extraction of a document, normally field values exist. By default, the new field values are appended to the existing ones. If this option is selected, the existing field values are deleted first. This only happens for fields where new field values are available. | |
Select to perform an additional full-page recognition. For this to be done, a new FullPageField "_FullPageField_" is added to the RecoStar project at runtime. The result will be acquired as an XBOUND engine result. | |
| This option allows for acquiring RecoStar's preprocessed (deskewed, despeckled, etc.) images as new XBOUND media. The geometries of the fields are adjusted in this case, and all extraction results are attached to the new image. The acquired medium is referenced by a new medium field "ReferenceToOriginalMedium" with the identifier of the original medium. |
Select whether to extract TIFF or JPEG images. | |
Select to extract only documents of certain document types. Then select all the document types that are to be extracted. If documents without types are to be extracted as well, select the option. | |
Select which type of RecoStar license you own and want to use for this project. | |
button | Opens the dialog, where you can configure how documents are to be classified and assign the interpretation results to XBOUND fields. |
These settings are also available:
button | Imports settings from an XML file that was previously created using . |
button | Exports the settings to an XML file. Specify the file name and location. You can then import the XML file to get the same settings. |
link | Opens a test form, where you can check a regular expression. |
ReadSoft Capture Framework activities: Overview
XBOUND activities: Overview (XBOUND Help topic)