Improving classifier training efficiency for automatic cyberbullying detection with Feature Density

作者:

Highlights:

• Feature Density can be utilized to reduce of the number of required experiments iterations.

• In general, Feature Density seems to have a negative correlation with classifier performance.

• Dependency structures could have potential as features in Neural Networks.

• Dataset complexity cannot be measured with Feature Density alone.

• Linguistic preprocessing can improve classifier performance.

摘要

•Feature Density can be utilized to reduce of the number of required experiments iterations.•In general, Feature Density seems to have a negative correlation with classifier performance.•Dependency structures could have potential as features in Neural Networks.•Dataset complexity cannot be measured with Feature Density alone.•Linguistic preprocessing can improve classifier performance.

论文关键词:Feature density,Dataset complexity,Linguistics,Cyberbullying,Document classification,Preprocessing

论文评审过程:Received 31 October 2020, Revised 6 April 2021, Accepted 28 April 2021, Available online 13 May 2021, Version of Record 13 May 2021.

论文官网地址:https://doi.org/10.1016/j.ipm.2021.102616