A multiple classifier approach to detect Chinese character recognition errors

作者:

Highlights:

摘要

Detection of recognition errors is important in many areas, such as improving recognition performance, saving manual effort for proof-reading and post-editing, and assigning appropriate weights for retrieval in constructing digital libraries. We propose a novel application of multiple classifiers for the detection of recognition errors. A need for multiple classifiers emerges when a single classifier cannot improve recognition-error detection performance compared with the current detection scheme using a simple threshold mechanism. Although the single classifier does not improve recognition error performance, it serves as a baseline for comparison and the related study of useful features for error detection suggests three distinct cases where improvement is needed. For each case, the multiple classifier approach assigns a classifier to detect the presence or absence of errors and additional features are considered for each case. Our results show that the recall rate (70–80%) of recognition errors, the precision rate (80–90%) of recognition error detection and the saving in manual effort (75%) were better than the corresponding performance using a single classifier or a simple threshold detection scheme.

论文关键词:Character recognition,Error detection,Pattern recognition and language modeling

论文评审过程:Received 5 December 2003, Accepted 20 September 2004, Available online 15 December 2004.

论文官网地址:https://doi.org/10.1016/j.patcog.2004.09.004