A robust and fast skew detection algorithm for generic documents

作者:

Highlights:

摘要

A robust and fast skew detection algorithm based on hierarchical Hough transform is proposed. It is capable of detecting the skew angle for various document images, including technical articles, postal labels, handwritten text, forms, drawings and bar codes. The algorithm is robust even when black margins introduced by photocopying are present in the image and when the document is scanned at a low resolution of 50 dpi. The algorithm consists of two steps. In the first step we quickly extract the centroids of connected components using a graph data structure. Then, a hierarchical Hough transform (at two different angular resolutions) is applied to the selected centroids. The skew angle corresponds to the location of the highest peak in the Hough space. The performance of the algorithm is shown on a number of document images collected from various application domains. The algorithm is not very sensitive to algorithmic parameters. For an A4 size document image scanned at 50 dpi (typically 413 × 575 pixels), our algorithm is able to detect the skew angle with an accuracy of 0.1° in 0.4s of CPU time on a SunSparc 20 workstation.

论文关键词:Skew detection,Document image processing,Hierarchical Hough transform,Block adjacency graph,Connected components

论文评审过程:Received 2 May 1995, Revised 2 October 1995, Accepted 20 February 1996, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/0031-3203(96)00020-9