Skew detection and reconstruction based on maximization of variance of transition-counts

作者:

Highlights:

摘要

The input document images with skew can be a serious problem in the optical character recognition system. A robust method is proposed in this paper for skew detection and reconstruction in document images which can contain less text areas, high noises, tables, figures, flow-chart, and photos. The basic idea of our approach is the maximization of variance of transition counts for the skew detection and text-orientation determination. Once the skew angle is determined, the scanning-line model is applied to reconstruct the skew images. 103 documents with great varieties have been tested and successfully processed. The average detection time of A4 size image is 4.86 s and the reconstruction time is 5.52 s. The proposed approach is also compared with the existing algorithms published in the literature and our method gets some significant improvements in skew detection and reconstruction.

论文关键词:Skew detection,Skew reconstruction,Scanning line,Transition-counts,Transition-counts variance,SNR,Scanning-line model

论文评审过程:Received 31 March 1997, Revised 30 April 1998, Accepted 12 January 1999, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(99)00045-X