The impact of deep learning on document classification using semantically rich representations

作者:

Highlights:

• Provides a novel document representation model enriched with semantical information.

• Make use of the background knowledge extracted from an ontology for incorporating semantics.

• An in-depth analysis and accuracy measurement of document classification using multiple deep learning architecture configurations.

• A comprehensive comparative evaluation of conventional machine learning and deep learning techniques on document classification.

• Proposed deep learning techniques outperform the conventional ones, achieving better accuracy in every point of testing.

摘要

•Provides a novel document representation model enriched with semantical information.•Make use of the background knowledge extracted from an ontology for incorporating semantics.•An in-depth analysis and accuracy measurement of document classification using multiple deep learning architecture configurations.•A comprehensive comparative evaluation of conventional machine learning and deep learning techniques on document classification.•Proposed deep learning techniques outperform the conventional ones, achieving better accuracy in every point of testing.

论文关键词:Document representation,Document classification,Deep learning,Ontology,Machine learning

论文评审过程:Received 18 September 2018, Revised 2 May 2019, Accepted 5 May 2019, Available online 15 May 2019, Version of Record 15 May 2019.

论文官网地址:https://doi.org/10.1016/j.ipm.2019.05.003