Learning stochastic edit distance: Application in handwritten character recognition

作者:

Highlights:

摘要

Many pattern recognition algorithms are based on the nearest-neighbour search and use the well-known edit distance, for which the primitive edit costs are usually fixed in advance. In this article, we aim at learning an unbiased stochastic edit distance in the form of a finite-state transducer from a corpus of (input, output) pairs of strings. Contrary to the other standard methods, which generally use the Expectation Maximisation algorithm, our algorithm learns a transducer independently on the marginal probability distribution of the input strings. Such an unbiased way to proceed requires to optimise the parameters of a conditional transducer instead of a joint one. We apply our new model in the context of handwritten digit recognition. We show, carrying out a large series of experiments, that it always outperforms the standard edit distance.

论文关键词:Stochastic edit distance,Finite-state transducers,Handwritten character recognition

论文评审过程:Received 2 August 2005, Revised 15 March 2006, Accepted 17 March 2006, Available online 5 May 2006.

论文官网地址:https://doi.org/10.1016/j.patcog.2006.03.011