Representations

A representation is the result of an OCR conversion of the source file. More than one full text conversion is possible and can be stored under representations. This object may have the following properties:

Pages

Each page in the representation is listed as created from the original source document. Normally there is a 1:1 relationship between XDoc pages and CDoc pages.

Text Lines

Text lines consisting of Words generated from OCR results.

Words

Indexed list of Words with characters, bounding rectangles and additional properties as generated from character recognition.