File format and compression format combinations

The supported file format / compression format combinations are listed here. The list of available combinations may vary, based on the application used with the export connector.

  • Multipage TIFF - CCITT Group 4

  • Multipage TIFF - CCITT Group 3

  • Multipage TIFF - Uncompressed

  • Multipage TIFF - CCITT Group 3/2D

  • Multipage TIFF - JPEG Compression

  • TIFF - CCITT Group 4

  • TIFF - CCITT Group 3

  • TIFF - Uncompressed

  • TIFF - CCITT Group 3/2D

  • TIFF - JPEG Compression

  • PCX - PackBytes

  • JPG - JPEG Compression

  • Multipage TIFF - LZW Compression

  • Tungsten PDF

If the images in the batch are mixed (color, grayscale, and/or bitonal images), select a file format combination that includes the TIFF file format with any compression other than JPEG compression. The exported images have the ".TIF" file extension. However, any color images in the batch contain compressed JPEG data within the TIFF file format.

Notes for exporting image, OCR full text, eDocument, and combination files

The following tables describe how documents are exported to the IBM Content Manager root destination, depending on the following:

  • Which image format is selected

  • Whether the Tungsten Capture "Allow import of eDocument files" option is enabled, or whether eDocuments have been imported into Tungsten TotalAgility

  • Whether OCR full text files are generated

The export connector supports the ability to export OCR as part of the image (image stored as the base), or to export OCR as the base (image is discarded). Set your preference on the General Settings tab during export connector setup. To create OCR full text files:

  • Tungsten Capture: Add the OCR Full Text module to the batch class and select the Enable OCR full text check box in the document class.

  • Tungsten TotalAgility: Not supported.

  • Tungsten Express (formerly Kofax Express) : Select Job Setup > Export Setup > PDF Setup > OCR tab, or select Job Setup > PDF Options > PDF Setup > OCR tab.

In IBM Content Manager, the following occurs for each document:

  • One document is exported for each multiple-image document, OCR full text file, eDocument, and/or Tungsten PDF document.

  • The documents are exported in the order described in the following tables.

Export images and eDocuments

Image Format

All Images

All eDocuments

Multipage TIFF

If "OCR Full Text -None" is selected on the General Settings tab, the result is one document that consists of all images.

If "Export OCR as part of the image (image stored as base)" is selected on the General Settings tab, the result is one document that consists of all images with OCR content as the last page (ICMBASETEXT).

If "Export OCR as base (image discarded)" is selected on the General Settings tab, the result is one document that consists of OCR content (ICMBASETEXT).

If "OCR Full Text -None" is selected on the General Settings tab, the result is one document that consists of all eDocuments.

If "Export OCR as part of the image (image stored as base)" is selected on the General Settings tab, the result is one document that consists of all eDocuments with OCR content as the last page (ICMBASETEXT).

If "Export OCR as base (image discarded)" is selected on the General Settings tab, the result is one document with OCR content (ICMBASETEXT) on the first page, followed by content for eDocuments on subsequent pages.

Tungsten PDF

If "OCR Full Text -None" is selected on the General Settings tab, the result is one document with PDF content.

If "Export OCR as part of the image (image stored as base)" is selected on the General Settings tab, the result is one document with PDF content followed by OCR content on the last page (ICMBASETEXT).

If "Export OCR as base (image discarded)" is selected on the General Settings tab, the result is one document that consists of OCR content (ICMBASETEXT).

If "OCR Full Text -None" is selected on the General Settings tab, the result is one document with PDF content on the first page, with eDocument content on subsequent pages.

If "Export OCR as part of the image (image stored as base)" is selected on the General Settings tab, the result is one document with PDF content followed by eDocument content, and OCR content on the last page (ICMBASETEXT).

If "Export OCR as base (image discarded)" is selected on the General Settings tab, the result is one document that consists of OCR content (ICMBASETEXT).

Export mixed images and documents (Tungsten Capture only)

Image Format

Result

Multipage TIFF and eDocuments

If "OCR Full Text -None" is selected on the General Settings tab, the result is one document that consists of all images and all eDocuments.

If "Export OCR as part of the image (image stored as base)" is selected on the General Settings tab, the result is one document that consists of all images and all eDocuments with OCR content as the last page (ICMBASETEXT).

If "Export OCR as base (image discarded)" is selected on the General Settings tab, the result is one document with OCR content (ICMBASETEXT) on the first page, followed by content for eDocuments on subsequent pages.

Tungsten PDF

If "OCR Full Text -None" is selected on the General Settings tab, the result is one document with PDF content on the first page, with eDocument content on subsequent pages.

If "Export OCR as part of the image (image stored as base)" is selected on the General Settings tab, the result is one document with PDF content followed by eDocument content, and OCR content on the last page (ICMBASETEXT).

If "Export OCR as base (image discarded)" is selected on the General Settings tab, the result is one document that consists of OCR content (ICMBASETEXT) on the first page, followed by content for all eDocuments on subsequent pages.