Recognition Profiles Window - Enhanced OCR Zonal
Use this window to select settings for the Tungsten Enhanced OCR Zonal recognition profile.
Name
Use the list to select a recognition profile. The other settings on the window are refreshed with the settings defined for the selected profile.
Engine
Enhanced OCR Zonal is the default setting.
Languages
This contains a single language, or a list of multiple languages separated by semicolons. The edit box can scroll to display all the selected languages. You cannot select languages here; this is for informative purposes only.
Select button
Click this button to select languages, a character set, or text direction. The Recognition Languages window appears.
Use Code page to define a specific code page for the recognition engine. The default value is "Unicode." Otherwise, select "Current Operating System's code page" to apply the code page of the current operating system.
You can select more than one language by using the Add button. If you use Chinese, Japanese, Korean or the Arabic language, adjust the Text Direction setting.
Mark and Spell
Use these settings to specify the minimum level of confidence to accept for character recognition. Characters that do not meet this minimum level are marked with the mark flag.
- General
-
If you select the General option, the adjacent list gives you a choice of three levels of confidence. The default level is Medium. The other choices are Low and High.
A setting of "Low" means that you accept a lower level of recognition confidence. There may be fewer mark flags in the results.
A setting of "Medium" means that you accept a moderate amount of recognition confidence. There may be more mark flags than with the "Low" setting.
A setting of "High" means that you require a greater degree of recognition confidence. There may be more mark flags than with the other settings.
- Specific
-
If you select the "Specific" option, you can specify a precise level of confidence ranging from 0 to 100.
The level value for "Low" is 75.
The level value for "Medium" is 85.
The level value for "High" is 95.
Spell check and Spell check flag
Select the "Spell check" option to have unrecognized words compared to entries in a dictionary of known values. If they do not match, a spell check flag is inserted in front of the non-matching word. A default dictionary is used, based on the currently selected language. You can also specify a custom dictionary by selecting a text file in the Document Class Properties window OCR tab.
Use the "Spell check flag" to specify a character to indicate words that are not found in a dictionary. You can specify only a single character. Note that the following occurs when you have enabled and specified a spell check flag:
-
For a single language: All words are flagged when the selected language does not have a dictionary.
-
For multiple languages: Only words not found in the dictionary are flagged. If the selected language does not have a dictionary, then the word is not flagged.
If a word is flagged twice, once with the spell check flag and once with the confidence mark flag, the spell check flag is first, followed immediately by the confidence mark flag. However, if the two flags are set to the same character (for example ^), both flags are represented by a single character. This is the default behavior.
Character set
Use these settings to define the digit or character data filter.
A setting of "Any" means that the recognized data can contain any characters.
A setting of "Numbers only" means that the recognized data contains only digits.
A setting of "Letters only" means that the recognized data can contain only letters.
A setting of "Custom" means that you can add a set of characters to include to the recognition result. For example, if the recognition zone includes dates in the format DD/MM/YYYY, enter digits and the "/" symbol in the custom filter input field. In this case, the engine recognizes the date correctly.
Advanced button
Click this button to specify advanced recognition settings. The Enhanced OCR Recognition Settings window appears.
Image Cleanup
Select an image cleanup profile from the list.
Edit button
To modify an existing image cleanup profile or create a new one, click the Edit button. The Image Cleanup Profiles window appears, and you can specify the type of image cleanup and other advanced settings.
Delete button
Click this button to delete the currently selected profile. It is not possible to delete profiles that are built in to Tungsten Capture.
Script button
If enabled, use this button to assign a recognition script to the selected profile. The Recognition Script window appears, and you can associate a recognition script with the recognition profile.
Test button
Click this button to test your zone settings. Your recognition and cleanup settings are applied to the zone with the results displayed in the Zone Test window.