Performance analysis of an OCR system via an artificial handwritten chinese character generator

作者:

Highlights:

摘要

A handwritten Chinese character generator that generates handwritten Chinese characters with different variations is proposed and used to evaluate the performance of an OCR system. The character generator first generates radicals of a character stroke by stroke and then combines the generated radicals to form a line-vector character. Next, the line-vector character is thickened to obtain a character image. Characters generated by this method will have great variance in shape but will still satisfy the structural constraints. The generated characters are then used to perform two types of evaluations. First, the stability of stroke extractors in terms of stroke number is evaluated. Two stroke extractors, one based on thinning and the other on vectorization, are analyzed. The vectorization-based stroke extractor we propose operates directly on the run length codes of line segments. From experimental results, we find that the thinning-based method is more time-consuming but more stable than the vectorization-based method. Second, the peak performance of a matching module is evaluated and the recognition error caused by stroke extractors is identified.

论文关键词:Handwritten Chinese character generator,Radical,Line-vector characters,Stroke extractor,B-spline functions

论文评审过程:Received 13 January 1993, Revised 13 August 1993, Accepted 23 August 1993, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(94)90055-8