Off-line cursive script recognition: current advances, comparisons and remaining problems

作者:Amjad Rehman, Tanzila Saba

摘要

This paper presents detailed review in the field of off-line cursive script recognition. Various methods are analyzed that have been proposed to realize the core of script recognition in a word recognition system. These methods are discussed in view of the two most important properties of such systems: size and nature of the lexicon involved and whether or not a segmentation stage is present. Script recognition techniques are classified into three categories: firstly, segmentation-free methods or holistic approaches, that compare a sequence of observations derived from whole word image with similar references of words in the small lexicon. Secondly, segmentation-based methods, that look for the best match between consecutive sequences of primitive segments and letters of a possible word similar to human-like reading technique, in which secure features found all over the word are used to boot-strap a few candidates for a final evaluation phase; thirdly, hybrid approaches. Additionally, different feature extraction techniques are elaborated in conjunction with the classification process. In this scenario, implications of single and multiple classifiers are also observed. Finally, remaining problems are highlighted along with possible suggestion and strategies to solve them.

论文关键词:Script recognition, Character segmentation, Character recognition, Feature extraction, Holistic approaches

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10462-011-9229-7