TableXTract table recognition and data extraction tool

CSDK offers the TableXTract tool for table recognition and data extraction to XML. While the table recognition module is integral part of CSDK, TableXTract is a command line tool, offering layout detection for accustomed multipage tables. It tolerates formatting and various merged cell structures.

Note that tables and forms require different processes in CSDK. If you work with forms, see Form recognition module.

For detailed instructions, see Process a document with TableXTract.