A comparative analysis of machine learning techniques for student retention management

作者:

摘要

Student retention is an essential part of many enrollment management systems. It affects university rankings, school reputation, and financial wellbeing. Student retention has become one of the most important priorities for decision makers in higher education institutions. Improving student retention starts with a thorough understanding of the reasons behind the attrition. Such an understanding is the basis for accurately predicting at-risk students and appropriately intervening to retain them. In this study, using five years of institutional data along with several data mining techniques (both individuals as well as ensembles), we developed analytical models to predict and to explain the reasons behind freshmen student attrition. The comparative analyses results showed that the ensembles performed better than individual models, while the balanced dataset produced better prediction results than the unbalanced dataset. The sensitivity analysis of the models revealed that the educational and financial variables are among the most important predictors of the phenomenon.

论文关键词:Retention management,Student attrition,Classification,Prediction,Machine learning,Sensitivity analysis

论文评审过程:Received 25 March 2010, Revised 5 May 2010, Accepted 12 June 2010, Available online 17 June 2010.

论文官网地址:https://doi.org/10.1016/j.dss.2010.06.003