Text line and word segmentation of handwritten documents

作者:

Highlights:

摘要

In this paper, we present a segmentation methodology of handwritten documents in their distinct entities, namely, text lines and words. Text line segmentation is achieved by applying Hough transform on a subset of the document image connected components. A post-processing step includes the correction of possible false alarms, the detection of text lines that Hough transform failed to create and finally the efficient separation of vertically connected characters using a novel method based on skeletonization. Word segmentation is addressed as a two class problem. The distances between adjacent overlapped components in a text line are calculated using the combination of two distance metrics and each of them is categorized either as an inter- or an intra-word distance in a Gaussian mixture modeling framework. The performance of the proposed methodology is based on a consistent and concrete evaluation methodology that uses suitable performance measures in order to compare the text line segmentation and word segmentation results against the corresponding ground truth annotation. The efficiency of the proposed methodology is demonstrated by experimentation conducted on two different datasets: (a) on the test set of the ICDAR2007 handwriting segmentation competition and (b) on a set of historical handwritten documents.

论文关键词:Handwritten document image analysis,Hough transform,Text line segmentation,Word segmentation,Gaussian mixture modeling

论文评审过程:Received 8 August 2008, Revised 12 November 2008, Accepted 21 December 2008, Available online 4 January 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.12.016