Close X

Image Window - Settings

Configure OCR Options... Dialog Settings

TopOCR OCR Document Type

Allows you to select between multi-column books or single column receipts. Receipt mode is also useful for retaining original layout. TopOCR is only utilized for TWAIN scanning and for reading multi-page PDF files.

Binarization Type

TopOCR uses different image processing algorithms depending on whether the image originated from a scanner or a camera. This setting allows you to specifiy either camera or scanner as the image source.

Camera Binarization

  1. Background Removal Slider - remove noise and objects like fingers from the document image background

  2. Contrast Equalize - equalizes background on more reflective images, such as a glossy magazine page

  3. Contrast Maximize - maximizes background contrast to produce better text binarization from darker backgrounds - do not use on lighter backgrounds

  4. Small Print - helps to enhance small text

  5. Straighten Columns - removes document skew and page curl from books and removes non text data

  6. Neural Warp - removes distortion from entire page

Tesseract OCR Engine Selection

Allows you to select which OCR classifier Tesseract will use for character recognition. Either the LSTM OCR engine or the TAO OCR engine can be selected.


Allows you to select the language for OCR recognition, the default is English.

Accessible Mode

Switch between the standard Windows GUI Mode and the Accessible User Interface Mode.