Train the Trainable Evaluator
By training this evaluator, you are teaching your project to extract the best results from input locators, even if those locators are not trainable themselves. For example, you can use a Format Locator to extract a customer number on a document. The Format Locator itself is not trainable.
The Trainable Evaluator has subfields for input locators. All of the alternatives for the input locators are tested, and the alternative with the highest confidence based on existing training documents is returned. For the best results, add five or more ideal training documents for each piece of information that you are training. If a single document contains data for multiple subfields, you do not need one for each individual subfield.
You can train the Trainable Evaluator by following these steps:
- Open the Project Tree window if it is not already open.
- Expand the Project Tree and select the class.
-
Optionally,
view the class contents if they are not
already displayed.
The hidden class contents are displayed.
- Open the Documents window if it is not already open.
- If a different view is in use, switch to the List view .
-
Add or open a document set that contains the
documents to use to train the
Trainable Evaluator.
A list of documents is displayed.
-
Double-click a document.
The Document Viewer is displayed showing the selected document.
-
If the document is suitable for training, select the document in
the
list view of the
Documents window and select
Train for Extraction
from the menu.
The Edit Document window is displayed showing the selected document.
-
In the
Edit Document window,
select a field and lasso the corresponding content in the document.
The lassoed data is entered into the field.
-
Repeat lassoing for each field and when finished, click
Add to Training Folder
.
The document is added to your Extraction Set and the next document in your document set is loaded automatically.
Edit Document window automatically, once you have processed the first document. If you do not wish to add more documents to the extraction training set for this class, close the Edit Document window.If there are additional documents in the document set, these are loaded in the -
Continue to add training documents until you are ready to test
your training set, and
Close the
Edit Document window.
-
On the
Process Ribbon tab, in
the
Train group, click
Extraction
.
The documents in your training set are trained and a progress bar lets you view the progress.
-
In your test document set, right-click one or more selected
documents, and click
Process.
The document is classified and extracted.
-
Open the
Extraction Results window if it is
not already open.
The extraction results are displayed. Invalid fields have a blue question mark and valid fields have a green check mark.
-
In the
Extraction Results
window, view the
Trainable Evaluator results based on your
training documents.
If the results are not satisfactory, add training documents to your Extraction Set by repeating steps 7 through 14.