Detection of financial statement fraud and feature selection using data mining techniques

作者:

Highlights:

摘要

Recently, high profile cases of financial statement fraud have been dominating the news. This paper uses data mining techniques such as Multilayer Feed Forward Neural Network (MLFF), Support Vector Machines (SVM), Genetic Programming (GP), Group Method of Data Handling (GMDH), Logistic Regression (LR), and Probabilistic Neural Network (PNN) to identify companies that resort to financial statement fraud. Each of these techniques is tested on a dataset involving 202 Chinese companies and compared with and without feature selection. PNN outperformed all the techniques without feature selection, and GP and PNN outperformed others with feature selection and with marginally equal accuracies.

论文关键词:Data mining,Financial fraud detection,Feature selection,t-statistic,Neural networks,SVM,GP

论文评审过程:Received 20 November 2009, Revised 14 June 2010, Accepted 3 November 2010, Available online 12 November 2010.

论文官网地址:https://doi.org/10.1016/j.dss.2010.11.006