InstanceRank based on borders for instance selection

作者:

Highlights:

摘要

Instance selection algorithms are used for reducing the number of training instances. However, most of them suffer from long runtimes which results in the incapability to be used with large datasets. In this work, we introduce an Instance Ranking per class using Borders (instances near to instances belonging to different classes), using this ranking we propose an instance selection algorithm (IRB). We evaluated the proposed algorithm using k-NN with small and large datasets, comparing it against state of the art instance selection algorithms. In our experiments, for large datasets IRB has the best compromise between time and accuracy. We also tested our algorithm using SVM, LWLR and C4.5 classifiers, in all cases the selection computed by our algorithm obtained the best accuracies in average.

论文关键词:Instance selection,Instance ranking,Border instances,Supervised classification

论文评审过程:Received 2 November 2011, Revised 4 June 2012, Accepted 14 July 2012, Available online 30 July 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.07.007