Multidimensional Classification plugin
Multidimensional Classification is a plugin within the Page Process module. This patented plugin classifies documents across various dimensions and combines the score of each dimension. This results in improved accuracy and confidence in document learning.
For the following reasons, Multidimensional Classification is a better approach to classifying documents, as compared to the older Search Classification method:
-
Search Classification was the original method of page classification with Transact. Although this older method continues to work well, it is a less accurate approach.
-
Multidimensional Classification contains newer algorithms that perform classification more accurately. This feature also improves document assembly.
Configure the Multidimensional Classification plugin
This topic describes how to configure, enable, or disable the MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN in the Page Process workflow module and in a batch class. Ensure that you have administrator rights to perform these tasks. These configurations are made from the Batch Class Management.
Note that following:
-
This plugin is included by default in the BC1 batch class.
-
This plugin must be added manually to a batch class that is not based on the BC1 template batch class.
-
The default mode of this plugin is OFF.
You have the option of leaving both plugins in the module if you disable the SEARCH_CLASSIFICATION plugin.
If you plan to use the Multidimensional Classification plugin to create classification results from document learning and to perform document assembly, we recommend disabling or removing the SEARCH_CLASSIFICATION plugin. See Remove or disable the Search Classification plugin.
Do the following to configure the Multidimensional Classification plugin:
- Launch Transact. Select . Enter login credentials when prompted.
- Select an existing batch class and click Open or create a new batch class. You can also copy or import an existing batch class, then modify it to create a new batch class.
-
In the batch class, select
.
The Associated Plugins and Selected Plugins sections appear in the Plugin Configuration screen.
-
Add the plugin to the
Page Process module:
-
Select the
MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN from the
Associated Plugins section of the Plugin Configuration screen and click the right arrow.
This moves the plugin to the Selected Plugins section.
- Use the up and down arrows to rearrange the position of this plugin in relation to the other plugins of this module as needed.
-
Click
Deploy to activate the plugin, save the changes made in step
4.b, and click
Close to complete the configuration.
This message appears when the update is completed: Batch Class updated successfully.
This plugin is included in the Page Process module and is ready to be activated.
-
Select the
MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN from the
Associated Plugins section of the Plugin Configuration screen and click the right arrow.
-
Enable the
MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN for the batch class by selecting this plugin within the
Page Process module.
The Plugin Configuration screen appears on the right.
- Review or configure this plugin by opening a batch class from the Batch Class Management screen.
-
Enable the plugin by setting the
Multidimensional Classification Switch to
ON.
The Multidimensional Classification Switch contains the following two options in the fields:
-
ON: Enables this plugin. When this plugin is enabled, Transact uses this plugin to classify documents.
-
OFF: Disables this plugin. When this plugin is disabled, Transact does not use this plugin to classify documents.
-
-
Set the
Multidimensional Classification Max Results to the required number.
The Multidimensional Classification Max Results field controls how many alternate value elements are generated in the batch.xml file produced within the workflow. The default setting for max results is 5 to keep the overall size of the batch.xml smaller. Adjust this setting as required. The size of the batch.xml can be increased or decreased based on your setting.
- Click Deploy to enable the changes.
- Click Close to exit the Plugin Configuration screen.
Once configured, the Transact administrator must train at least one document for each document type with the Multidimensional Classification plugin. See Document learning with Multidimensional Classification.
See Remove or disable the Search Classification plugin after the Multidimensional Classification configuration.
Set the classification type for the Document Assembler plugin
This topic describes how to select Multidimensional Classification as the classification type in the DOCUMENT_ASSEMBLER plugin in the Document Assembly module. The MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN must first be configured and enabled.
Before you proceed, verify that you complete steps in Configure the Multidimensional Classification plugin.
Do the following:
- Launch Transact and select . Enter login credentials when prompted.
- Open the batch class where the MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN is enabled. Select the batch class and click Open.
-
Go to the
Document Assembly module and select the
DOCUMENT_ASSEMBLER plugin.
The Plugin Configuration screen for DOCUMENT_ASSEMBLER plugin displays.
-
Select
MultidimensionalClassification from the
DA Classification Type field.
The confirmation windows appear when you click both Apply and Deploy.
- Click Close to return to the Batch Class Management screen.
Remove or disable the Search Classification plugin
If you configure and enable the Multidimensional Classification plugin, disable or remove the Search Classification plugin. The user must have Administrator rights to perform this task.
If you have any scripts that rely on results from the Search Classification plugin, leave them in the module.
Perform these steps to add, enable, disable, or remove the SEARCH_CLASSIFICATION plugin in the Page Process module for the batch class that uses MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN.
- Launch Transact and select . Enter login credentials when prompted.
- Either select an existing batch class and click Open, or create a new batch class.
-
Go to the
Page Process module.
The Associated Plugins and Selected Plugins for the Page Process module appear on the right.
-
Perform one of the following steps to remove or disable the
Search Classification plugin:
- Remove the SEARCH_CLASSIFICATION plugin by selecting it from the Selected Plugins field on the right. Use the left arrow button to move this plugin to the Associated Plugins field on the left.
- Disable the SEARCH_CLASSIFICATION plugin by selecting the plugin and selecting OFF from the Plugin Configuration screen.
- Click Deploy to activate this change.
-
Retrain
Transact
with
Learn Files for the document type, as applicable.
When you disable the Search Classification plugin, you must retrain the batch class with the documents using the Multidimensional Classification plugin, which generates its own model for classifying the data.
-
Use the Learn Files button in the Batch Class Management screen.
-
Continue to Document learning with Multidimensional Classification.
-
Document learning with Multidimensional Classification
The standard method of training a batch class for a document type applies to both Search Classification and Multidimensional classification.
The Multidimensional Classification mechanism works on supervised learning.
-
The plugin for Search Classification or Multidimensional Classification uses learning to classify the pages in the given batch. Such learning includes search classification.
-
The MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN takes a sample of search classification and updates it during the learning process.
Within an open batch class, use the Learn File(s) button in the Batch Class Management screen. When a document type is trained with at least one file, the plugin creates a new file with the following name in the Batch Class folder:
BC<ID>-dimensions
Example: BC8-dimensions
Click Learn File(s) to update the learning that occurs during classification.
Multidimensional classification and machine learning
The MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN feature is supported by machine learning for document types. In this case, if classification is done incorrectly during the initial process job, the plugin can learn the change that is required to correct the classification and ensure it performs classification correctly with verifying batch processing jobs.
Test classification of a document type with Multidimensional Classification
This topic describes how to learn a new document to test for classification of a new document type. Before you proceed, verify that you complete all the configurations for the Multidimensional Classification, Search Classification, and Document Assembler plugins. See the following topics for additional information about configuring these two plugins:
Also, to complete this procedure, you must have sample documents available in electronic format, PDF or TIFF, for each document type that must be tested.
Perform the following steps to test classification of a document type, for the batch class with the Multidimensional Classification plugin enabled:
- Launch Transact and select . Enter login credentials when prompted.
-
Open the batch class in which the
Multidimensional Classification plugin is enabled. Select the batch class, then click
Open.
The batch class opens with a list of document types.
-
To learn samples for a document type, upload sample documents.
- For each document type, click Learn Files.
- Select the document type and click the Upload Test Classification File(s) link to select and upload a test image file for the document type.
You can also drag and drop the sample image file to the Drag and Drop Files Here area below the Upload Test Classification File(s) link.
The following message displays and notifies you when the test file upload is complete, File(s) Uploaded Successfully For Classification.
-
Navigate to the document type screen where you uploaded the test image file (in the previous step) and click
Test Classification from the toolbar on top of the screen.
The Test Classification screen appears.
-
Select an option from the
Workflow field.
Option Description ON If the Workflow field is set to ON, the Classification Types field is disabled. Test classification results are based on configurations within the batch class.
OFF If the Workflow field is set to OFF, the test classification results are based on the selection that you make from the Classification Types field. The types include:
-
Search Classification
-
Barcode Classification
-
Image Classification
-
Automatic Classification
-
Keyword Classification
-
Multidimensional Classification
-
-
Click
Classify.
The Test Classification screen is updated with the classification results.
Additional information about Multidimensional Classification
Learning with Multidimensional Classification
Multidimensional classification can be described as page classification learning combined with search classification learning. This classification mechanism works on supervised learning.
The user begins this process by importing documents then moving to the Page Process module and clicking Learn Files to update learning for page and search classification.
The plugin uses learning to classify the pages in the given batch, and simultaneously learns search classification. The plugin takes the sample of search classification and makes corresponding updates to the page-classification learning.
Phases of Multidimensional Classification
Click Learn Files to begin the learning phase. A single model file is kept per batch class. In the learning phase for a document, the system performs calculations and saves the model file. Each document page type (first page, middle page, and last page) saves multiple types of information.
During the batch execution phase, the pages in the batch are classified according to learning.
When performing auto-learning of the document type and indexes for search classification, Transact also updates the model file for multidimensional classification.
During the document assembly phase, this plugin works in the same way as search classification.
More information
For additional information about configuring or using classification in Ephesoft Transact, see the following topics:
For additional information about batch class creation, setup, and configuration, see the following topic: