› Use Capture SDK › Code sample for handling large volume output
Code sample for handling large volume output
If several hundred pages need to be converted into a single PDF output, the
straightforward workflow may need more system resources than available. To reduce
the system resource needs, input data should be grouped into smaller chunks for
processing, and append the processed chunks to the output. The input data can be any
single or multi-page image file, or a set of files.
Output type
Output file type
Recommended processing size
Direct TXT output
DTXT_PDFIOT and DTXT_IOTPDF_MRC
5 to 10 pages
Layout retention output (Formatted output)
PDF, PDFEdited, PDFImageOnText, and PDFImageSubst
50 to 100 pages
RecSetOutputFormat(sid, L"Converters.Text.PDFImageOnText");
// set Converters.Text.PDFImageOnText.AppendFrom to the output file name
kRecSettingGetHandle(NULL, "Converters.Text.PDFImageOnText.AppendFrom", &hset, NULL);
kRecSettingSetUString(sid, hset, outpdf);
for (int i=0; i<pages; i++) {
int ind = i % PACK; // cache index
if (ind == 0)
RecCreateDoc(sid, L"", &hDoc, DOC_NORMAL);
kRecLoadImg(sid, hFile, &hPages[ind], i);
kRecPreprocessImg(sid, hPages[ind]);
kRecRecognize(sid, hPages[ind], NULL);
RecInsertPage(sid, hDoc, hPages[ind], -1);
if (ind+1 == PACK || i+1 == pages) { // append cached pages
RecConvert2Doc(sid, hDoc, outpdf);
RecCloseDoc(sid, hDoc);
}
}
Search results
Search tips
The search returns topics that contain terms you enter. If you type more than one term, an OR is assumed, which returns topics where any of the terms are found. Enclose your search terms in quotes for exact-phrase matching.
The search also uses fuzzy matching to account for partial words (such as install and installs). The results appear in order of relevance, based on how many search terms occur per topic. Exact matches are highlighted.
To refine the search, you can use the following operators:
Type + in front of words that must be included in the search or - in front of words to exclude. (Example: user +shortcut –group finds shortcut and user shortcut, but not group or user group.)
Use * as a wildcard for missing characters. The wildcard can be used anywhere in a search term. (Example: inst* finds installation and instructions.)
Type title: at the beginning of the search phrase to look only for topic titles. (Example: title:configuration finds the topic titled “Changing the software configuration.”)
For multi-term searches, you can specify a priority for terms in your search. Follow the term with ^ and a positive number that indicates the weight given that term. A higher number indicates more weight. (Example: shortcut^10 group gives shortcut 10 times the weight as group.)
To use fuzzy searching to account for misspellings, follow the term with ~ and a positive number for the number of corrections to be made. (Example: port~1 matches fort, post, or potr, and other instances where one correction leads to a match.)
Note that operators cannot be used as search terms: + - * : ~ ^ ' "