Using the Straighten Columns Function

Images captured from Document Cameras can commonly have two different types of "positional" image distortion; a rotational distortion and page curl distortion from books. TopOCR tries to rectify this distortion by the following process:

  1. Perform a document layout analysis of the image and detect the location of columns of text, graphics and clipped columns of text near the edges of the document.


  2. Determine if this particular image has just one single column of text or has multiple horizontal columns.


  3. In the case of one single column, it will straighten the entire image, if the image has multiple horizontal columns, since each column can have different distortion characteristics, it will try to straighten each individual column separately. This is typical for an image of an open book with two pages, each of which can have its own unique distortion.


  4. TopOCR will straighten the columns of text and also remove graphics and clipped columns of text. The process is very fast, generally taking under .25 sec once an image has been binarized. This process can greatly improve OCR accuracy, in some cases by as much as 40%-50%!


Note: Straighten Columns will not work very well on more extreme cases of cylindrical book curvature. As a consequence of this, it's recommended that you use your hands to flatten the pages to reduce the degree of curvature. The current version of this function does not work well on images that have greater than 12 degress of skew.

Straighten Columns doesn't touch single lines or short columns of less than 4 lines of text.

Some examples of using the Straighten Columns function is given below in a sequence of input and output images: