Ensemble of keyword extraction methods and classifiers in text classification

作者:

Highlights:

• Text classification is a domain with high dimensional feature space.

• Extracting the keywords as the features can be extremely useful in text classification.

• An empirical analysis of five statistical keyword extraction methods.

• A comprehensive analysis of classifier and keyword extraction ensembles.

• For ACM collection, a classification accuracy of 93.80% with Bagging ensemble of Random Forest.

摘要

•Text classification is a domain with high dimensional feature space.•Extracting the keywords as the features can be extremely useful in text classification.•An empirical analysis of five statistical keyword extraction methods.•A comprehensive analysis of classifier and keyword extraction ensembles.•For ACM collection, a classification accuracy of 93.80% with Bagging ensemble of Random Forest.

论文关键词:Keyword extraction,Text classification,Ensemble learning,Scientific text classification

论文评审过程:Received 4 January 2016, Revised 22 March 2016, Accepted 26 March 2016, Available online 29 March 2016, Version of Record 11 April 2016.

论文官网地址:https://doi.org/10.1016/j.eswa.2016.03.045