Create OCR Input plugin

This plugin is used for generating .png files corresponding to input files. These input files may be .tiff files or multipage .tiff files. These .png files are used for further processing and OCR. It uses ImageMagick for converting files to .png which will be used for OCR.

Configuration

Steps for configuring the plugin

Select the Page Process module and navigate to the CREATE_OCR_INPUT plugin configuration page.

Configurable properties

This plugin has no configurable properties.

Steps of execution

  • Plugin uses .tiff files as input.

  • While executing, ImageMagick parameters are used to generate the OCR, display .png thumbnail files and the .tiff files used for comparison and OCR.

  • These files are then copied to the batch instance folder and their respective entries are made into batch.xml file.

Dependency

The plugin assumes the incoming batch has been imported properly and batch.xml is created successfully.

Troubleshooting

The following table lists error messages that may occur if a plugin malfunction occurs.

Error message

Possible root cause

Problem in generating .png files

Some error occurred in generating .png files.

Improper Folder Specified folder name

Batch instance folder name is incorrect or does not exist. Make sure that the shared folder path is accurate.

Problem generating list of files

Batch instance folder name or path is incorrect.

Command cannot be run

ImageMagick is not working or ImageMagick configuration is not correct.