Recognition Profiles Window - Kofax PDF Text Under Image
Use this window to select settings for the Kofax PDF Text Under Image recognition profile.
Name
Use the list to select a recognition profile. The other settings on the window are refreshed to indicate the settings defined for the selected profile.
Engine
This is preset to PDF Image + Text.
Languages
This contains a single language, or a list of multiple languages separated by semicolons. The edit box can scroll to display all the selected languages. You cannot select languages here; this is for informative purposes only.
Select button
Click this button to select languages from the Recognition Languages window.
Mark level
Use these settings to specify the minimum level of confidence to accept for character recognition. Characters that do not meet this minimum level are identified with the mark flag.
- General
-
Use this setting to select from three levels of confidence. The default level is Medium. The other choices are Low and High.
A setting of Low indicates a lower level of recognition confidence, which results in fewer mark flags.
A setting of Medium indicates a moderate amount of recognition confidence which results in more mark flags than with the Low setting.
A setting of High indicates that you require a greater degree of recognition confidence, which may result in more mark flags than the other levels.
- Specific
-
Use this setting to define a precise level of confidence ranging from 0 to 100.
The level value for Low is 75.
The level value for Medium is 85.
The level value for High is 95.
Spell check and Spell check flag
Select the
Spell check
option to have unrecognized words compared to entries in a dictionary of known
values. If they do not match, a spell check flag is inserted in front of the non-matching word. A default dictionary corresponding
to the currently selected language is used. You can also specify a custom dictionary by selecting a text file in the
Document Class Properties window
OCR tab.
Use the
Spell check flag
to specify a character to use to indicate words not found in a dictionary.
You can specify only a single character. Note that the following occurs when you specify a spell check flag:
-
For a single language: all words are flagged when the selected language does not have a dictionary.
-
For multiple languages: only words not found in the dictionary are flagged. If the selected language does not have a dictionary, then the word is not flagged.
If a word is flagged twice, once with the spell check flag and once with the confidence mark flag, the spell check flag is first, followed immediately by the confidence mark flag. However, if the two flags are set to the same character (for example, ^), both flags are represented by a single character. This is the default behavior.
Non-natural language
Use the text entry field below the check box to specify the characters that are valid to include in a word.
Advanced button
Use this button to specify advanced recognition settings in the Advanced OCR Recognition Settings window.
Output button
Use this button to open the output format window, where you select preferences for displaying PDF output.
Image cleanup
Select an image cleanup profile from the list.
Edit button
To modify an existing image cleanup profile or create a new one, use the Edit button to open the Image Cleanup Profiles window, where you specify the type of image cleanup to use, along with other advanced settings.
Delete button
Use this button to delete the currently selected profile. You cannot delete profiles that are built in to Kofax Capture.
Script button
If available, use this button to assign a recognition script to the selected profile in the Recognition Script window, where you associate a recognition script with the recognition profile.
Test button
This button is unavailable for this recognition profile.