Extract document types
You can define fields and extract data from document types.
-
Select
Yes using the toggle button for
Define Fields and teach the system to extract data from your document types.
The configured document types are displayed.
-
Click the document for which you want to define the fields.
The selected document appears in the middle pane.
-
Lasso the fields you want to extract from the documents using the mouse.
The Field Extraction Training dialog box is displayed.
-
Configure the field properties.
- Provide a Name for the field.
- On the Type list, select Text, Date or Number.
- On the Formatter list, select the formatter.
-
Define the following validation rules as applicable for the field:
-
Is mandatory: Select if you want the field to be mandatory.
-
Minimum character length: Provide the minimum character length.
-
Maximum character length: Provide the maximum character length.
-
Defined allowed characters: Define the characters you want to allow for the field.
-
Defined restricted characters: Define the characters you want to restrict for the field.
Note-
The fields that are configured appear in green. When you hover your mouse on the field, a message pops up indicating that it has been trained and whether it has any conflicts.
-
Configuring the field types and validation rules help the system look at the ones that do not match your rules. You can also check the fields that can be extracted on the documents that are not trained.
-
You should show the system three instances of each field in each document type unless it is already able to find the field on each sample after viewing it only once or twice.
-
- Repeat the same process for other documents in your document types.
- Click Save.