Feature selection using Bayesian and multiclass Support Vector Machines approaches: Application to bank risk prediction

作者:

Highlights:

摘要

This paper presents methods of banks discrimination according to the rate of NonPerforming Loans (NPLs), using Gaussian Bayes models and different approaches of multiclass Support Vector Machines (SVM). This classification problem involves many irrelevant variables and comparatively few training instances. New variable selection strategies are proposed. They are based on Gaussian marginal densities for Bayesian models and ranking scores derived from multiclass SVM. The results on both toy data and real-life problem of banks classification demonstrate a significant improvement of prediction performance using only a few variables. Moreover, Support Vector Machines approaches are shown to be superior to Gaussian Bayes models.

论文关键词:Multiclass bank’s risk,Gaussian Bayes classifier,Multiclass SVM,Variable selection,Stepwise algorithm,Risk factors

论文评审过程:Available online 6 September 2011.

论文官网地址:https://doi.org/10.1016/j.eswa.2011.08.172