Automatic spelling correction using a trigram similarity measure

作者:

Highlights:

摘要

A nearest neighbour search procedure is described for the automatic correction of misspellings. The procedure involves the replacement of a misspelt word by that word in a dictionary which best matches the misspelling, the degree of match being calculated using a similarity coefficient based on the number of trigrams common to the two words. Experiments with a collection of 1544 misspellings and a dictionary of 64,636 words suggest that the procedure results in the unique identification of the correct spelling for over 75% of the misspellings if the correct form of the word is in the dictionary, and that this figure may be increased to over 90% if near, rather than nearest, neighbours are acceptable.

论文关键词:

论文评审过程:Received 4 February 1982, Available online 13 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(83)90022-5