Computer recognition of printed Tamil characters

作者:

Highlights:

摘要

Computer recognition of machine-printed letters of the Tamil alphabet is described. Each character is represented as a binary matrix and encoded into a string using two different methods. The encoded strings form a dictionary. A given text is presented symbol by symbol and information from each symbol is extracted in the form of a string and compared with the strings in the dictionary. When there is agreement the letters are recognized and printed out in Roman letters following a special method of transliteration. The lengthening of vowels and hardening of consonants are indicated by numerals printed above each letter.

论文关键词:Character recognition,Template matching,Feature encoding,Tamil alphabet,Condensing procedure

论文评审过程:Received 3 January 1978, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(78)90032-8