A review of segmentation and contextual analysis techniques for text recognition

作者:

Highlights:

摘要

This paper presents a review of the literature on text-processing techniques. The techniques covered are segmentation of text and contextual recognition, both of which are required when considering text recognition of documents. Several different methods of text segmentation are compared for various image data formats. The problem of correct segmentation of joined and broken characters is also considered. Two techniques for contextual recognition are considered: the Markov-based methods and dictionary look-up methods. The various techniques for storing dictionary information are compared. A discussion of the importance of the choice of the correct context is given, together with guidance on which methods are best suited to which applications.

论文关键词:Text processing,Text segmentation,Text recognition,Markov methods,Levenshtein Distance N-gram techniques,Dictionary structure,Viterbi Algorithm,Contextual processing

论文评审过程:Received 23 January 1989, Revised 27 April 1989, Accepted 16 May 1989, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(90)90021-C