Efficient skew estimation and correction algorithm for document images

作者:

Highlights:

摘要

In this paper, we propose a fast skew estimation and correction algorithm for English and Korean documents based on a BAG (Block Adjacency Graph) representation. BAG is one of the most efficient data structures for extracting various information concerning connected components; the image rotation for skew correction is performed rapidly using the block information in the BAG. The proposed skew estimation algorithm uses a coarse/refine strategy based on the Hough transformation of connected components in the image. The skew correction algorithm then generates a non-skew image by rotating the blocks, rather than the individual pixels. An experiment using 2016 images from various English and Korean documents demonstrates how the proposed method is superior to conventional ones.

论文关键词:Skew estimation,Skew correction,BAG,Hough transform,Block rotation

论文评审过程:Received 11 May 2000, Revised 28 March 2001, Accepted 6 June 2001, Available online 4 October 2001.

论文官网地址:https://doi.org/10.1016/S0262-8856(01)00071-3