On the use of data filtering techniques for credit risk prediction with instance-based models

作者：

Highlights：

•

摘要

Many techniques have been proposed for credit risk prediction, from statistical models to artificial intelligence methods. However, very few research efforts have been devoted to deal with the presence of noise and outliers in the training set, which may strongly affect the performance of the prediction model. Accordingly, the aim of the present paper is to systematically investigate whether the application of filtering algorithms leads to an increase in accuracy of instance-based classifiers in the context of credit risk assessment. The experimental results with 20 different algorithms and 8 credit databases show that the filtered sets perform significantly better than the non-preprocessed training sets when using the nearest neighbour decision rule. The experiments also allow to identify which techniques are most robust and accurate when confronted with noisy credit data.

论文关键词：Finance,Credit risk,Instance selection,Outlier,Filtering,Editing,Nearest neighbour rule

论文评审过程：Available online 7 June 2012.

论文官网地址：https://doi.org/10.1016/j.eswa.2012.05.075