Notes on reading normal (non-image-only) PDF input
The conversion of PDF files in Kofax OmniPage Capture SDK for Linux depends on the fonts installed on your system. Processing PDF files requires several font packages.
The generated image and thus the recognition result will be better with the following font packages installed:
-
msttcorefonts: This is a Windows essential fonts pack, which may have various names under different Linux distributions.
-
liberation-fonts: A fonts pack available under Red Hat distributions.
For SuSE systems, refer to the Optimal Use of MS TrueType Core Fonts for a KDE Desktop on SuSE article.
Installing font files is especially important for Asian PDF input. For a good starting point, study the following Wikipedia article: Help:Multilingual support (East Asian)