Notes on reading normal (non-image-only) PDF input
The conversion of PDF files in Tungsten OmniPage Capture SDK for Linux depends on the fonts installed on your system. Processing PDF files requires several font packages.
The generated image and thus the recognition result will be better with the following font packages installed:
-
msttcorefonts: This is a Windows essential fonts pack, which may have various names under different Linux distributions.
-
liberation-fonts: A fonts pack available under Red Hat distributions.
For SuSE systems, refer to the Optimal Use of MS TrueType Core Fonts for a KDE Desktop on SuSE article.
Installing font files is especially important for Asian PDF input. For more information, refer to the Multilingual support article.