Automated entry system for printed documents

作者:

Highlights:

摘要

This paper proposes a system for automatically reading either Japanese or English documents that have complex layout structures that include graphics. First, document image segmentation and character segmentation are carried out using three basic features and the knowledge of document layout rules. Next, multi-font character recognition is performed based on feature vector matching. Recognition experiments with a prototype system for a variety of complex printed documents shows that the proposed system is capable of reading different types of printed documents at an accuracy rate of 94.8–97.2%.

论文关键词:Document entry system,Image processing,Document processing,Layout structure recognition,Character recognition,Feature extraction,Character segmentation

论文评审过程:Received 8 September 1989, Revised 30 January 1990, Accepted 19 February 1990, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(90)90112-X