Edit distance-based kernel functions for structural pattern classification

作者:

Highlights:

摘要

A common approach in structural pattern classification is to define a dissimilarity measure on patterns and apply a distance-based nearest-neighbor classifier. In this paper, we introduce an alternative method for classification using kernel functions based on edit distance. The proposed approach is applicable to both string and graph representations of patterns. By means of the kernel functions introduced in this paper, string and graph classification can be performed in an implicit vector space using powerful statistical algorithms. The validity of the kernel method cannot be established for edit distance in general. However, by evaluating theoretical criteria we show that the kernel functions are nevertheless suitable for classification, and experiments on various string and graph datasets clearly demonstrate that nearest-neighbor classifiers can be outperformed by support vector machines using the proposed kernel functions.

论文关键词:String matching,Graph matching,Edit distance,Kernel methods,Support vector machine

论文评审过程:Received 9 February 2006, Accepted 6 April 2006, Available online 6 June 2006.

论文官网地址:https://doi.org/10.1016/j.patcog.2006.04.012