Forty years of research in character and document recognition—an industrial perspective

作者:

Highlights:

摘要

This paper presents an overview on the last 40-years of technical advances in the field of character and document recognition. Representative developments in each decade are described. Then, key technical developments in the specific area of Kanji recognition in Japan are highlighted. The main part of the paper discusses robustness design principles, which have proven to be effective to solve complex problems in postal address recognition. Included are the hypothesis-driven principle, deferred decision/multiple-hypotheses principle, information integration principle, alternative solution principle, and perturbation principle. Finally, future prospects, the ‘long-tail’ phenomena, and promising new applications are discussed.

论文关键词:OCR,Character recognition,Handwriting recognition,Kanji recognition,Postal address recognition,Robustness design,Information integration,Hypothesis-driven approaches,Digital pen

论文评审过程:Received 15 February 2008, Revised 10 March 2008, Accepted 11 March 2008, Available online 27 March 2008.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.03.015