Text line detection in handwritten documents

作者:

Highlights:

摘要

In this paper, we present a new text line detection method for handwritten documents. The proposed technique is based on a strategy that consists of three distinct steps. The first step includes image binarization and enhancement, connected component extraction, partitioning of the connected component domain into three spatial sub-domains and average character height estimation. In the second step, a block-based Hough transform is used for the detection of potential text lines while a third step is used to correct possible splitting, to detect text lines that the previous step did not reveal and, finally, to separate vertically connected characters and assign them to text lines. The performance evaluation of the proposed approach is based on a consistent and concrete evaluation methodology.

论文关键词:Document image analysis,Handwritten text,Hough transform,Text line detection,Connected component analysis

论文评审过程:Received 13 April 2007, Revised 26 March 2008, Accepted 2 May 2008, Available online 21 May 2008.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.05.011