Handwritten character recognition through two-stage foreground sub-sampling

作者:

Highlights:

摘要

In this paper, we present a methodology for off-line handwritten character recognition. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the character image so that the resulting sub-images at each iteration have balanced (approximately equal) numbers of foreground pixels, as far as this is possible. Feature extraction is followed by a two-stage classification scheme based on the level of granularity of the feature extraction method. Classes with high values in the confusion matrix are merged at a certain level and for each group of merged classes, granularity features from the level that best distinguishes them are employed. Two handwritten character databases (CEDAR and CIL) as well as two handwritten digit databases (MNIST and CEDAR) were used in order to demonstrate the effectiveness of the proposed technique. The recognition result achieved, in comparison to the ones reported in the literature, is the highest for the well-known CEDAR Character Database (94.73%) and among the best for the MNIST Database (99.03%)

论文关键词:Handwritten character/digit recognition,Feature extraction,Two-stage classification

论文评审过程:Received 10 February 2009, Revised 11 February 2010, Accepted 23 February 2010, Available online 3 March 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2010.02.018