Perform an extraction benchmark
Use the procedure in this topic to perform an extraction benchmark.
To generate extraction benchmarks, you need to open your golden files or a benchmark document set in the Documents window.
After running an extraction benchmark, you might want to change your extraction configuration settings. The following are common mistakes you want to avoid:
-
Lowering the minimum confidence thresholds just to get more green valid fields. You may end up missing invalid data as a result.
-
Increasing the minimum confidence thresholds just to eliminate red incorrect valid fields. This likely does not fix the problem that is causing the incorrect valid fields, only masks it. These types of errors may still occur in production.
-
Modifying recognition profile settings to improve a specific field. This may adversely affect other locator methods and fields.
Instead, you should concentrate on the following:
-
Confirm the recognition engine you are using is the most effective for your documents.
-
Improve your classification results so they do not negatively affect your extraction results.
-
Improve the regular expressions used by a Format Locator.
-
Review and update zone settings such as image cleanup, anchoring and registration for an Advanced Zone Locator.
-
Review and improve any scripts used at both project and class level.
-
Verify that all databases are up-to-date and that your queries return the correct data.
-
Review and improve Table Locator settings for regular and trainable tables.
-
Update any other locator-specific settings that could improve extraction results.
Procedure
- Open the Documents window if it is not already open.
-
If a different view is in use, switch to the
List View
.
- Open the golden files you created for extraction testing.
-
On the
Process tab, in the
Benchmark group, select
Extraction Benchmark
and select one of the following settings from the submenu:
-
Extraction Benchmark (Selected Class)
-
Extraction Benchmark (Selected Class And Children)
-
Extraction Benchmark (All Classes)
The Extraction Benchmark window is displayed and the selected extraction benchmark runs.
-
-
If you made changes outside of this window, you can run another benchmark by selecting
Start.
The extraction benchmark results are displayed in the Summary and Details tables.
- Optional. Select Save to store the benchmark results for comparison later.
- Optional. Export your benchmark results as a .csv file that can be opened in another application.