Help > Activities > Classification activity

Classification activity

This activity is used for document classification and document separation in your XBOUND process.

If you need to add this activity to the XBOUND Process Designer, the file to add is xBoundActivityDU.dll. (For instructions, see the XBOUND Help topic Adding an activity to the Process Designer.)

Note: This activity must be assigned to an Activities Service in order to run. (For instructions, see the XBOUND Help topic Assigning activities to the Activities Service.)

These settings are available when configuring the Classification activity:

Parameter Set

Solutions imported from Capture Components Administration. In the drop-down list, solutions imported on the global and client levels are followed by [Global] and [Client] respectively. Solutions imported on the process level are displayed without an extension.

Select the parameter set (solution) you want to use for this activity.

Important: In a process that uses Knowledge Processing, all activities prior to the Knowledge Processing step must use the same Parameter set.

Batch specification

This drop-down list contains all available batch specifications from the Capture Components solution that you imported to the XBOUND process. (For instructions, see Importing solutions from Capture Components Administration.)

Select the batch specification that contains the document specifications that you want to use this activity on.

Save character data

Select this option only when you need information about individual characters. That is the case when you want to rubberband not only entire words but also parts of words during Verification, for example.

This option creates a character object in the data model for each character. When this option is not selected, the characters are saved as a string in the corresponding Word object.

Warning: This option significantly increases the memory requirements during Interpretation and Verification.

Classification

Use document classification in this process step

Select this option to perform document classification in this activity. This option must be selected.

The documents are pre-separated. Use only document classification.

This option is selected by default. Use this option when the documents to be processed by this activity are already separated into sub-documents using another method.

Automatic document separation

Select this option to configure document separation strategies in this activity. At least one of the three methods below must then be selected.

Note: This feature is licensed separately. In the XBOUND License Manager, licenses for this feature are displayed as XBOUND Separation.

Use page classifiers to find document separations

Selected by default. Uses page classifiers defined on the first and last pages of a document specification to separate documents. Typically used to separate varied material, for example in a mailroom.

Require similarity between pages in the same document

Requires pages within a document to have similarities in their headers and logotypes. or in captured field data, or both.

Compare headers and logotypes

Separates documents by comparing layouts or logos. Assumes that all pages within a document have similar layouts and logos. Typically used to separate multi-page invoices.

Compare captured field data

Separates documents based on field data found on the pages. This is often used to separate multi-page invoices or customer orders.

Note on this setting and ClosedUse page classifiers to find document separations:

If you need to use these two settings together, all document types that you do NOT want to separate with the captured field value must have page classifiers specified, and use either Fixed number of pages page logic, or both first and last page classifiers.

Create a new document after each blank sheet

Creates a document break after every blank sheet. Requires manual insertion of blank sheets into the input stream. Typically used when the other options available are insufficient.

Default document specification

Select a default document type for this activity. This document specification will be assigned to documents if another specification cannot be assigned with confidence – or (when combined with the below setting) if another one cannot be assigned at all. Optional.

Only assign if no guess is available

The default document specification is assigned to documents only when no guess is available at all. Optional.

Performance counters

Use capture performance counters in this process step

Allows you to add counters for this process step to a Windows Performance Monitor session. These can be useful for troubleshooting performance problems. This is for advanced users. Use of the Windows Performance Monitor is beyond the scope of this help.

You can import or export settings from this activity using these buttons in the Parameter Set dialog:

Import button

Imports settings from an XML file that was previously created using Export.

Export button

Exports the settings to an XML file. Specify the file name and location. You can then import the XML file to get the same settings.

ReadSoft Capture Components activities: Overview