LSTM\(^{2}\): Multi-Label Ranking for Document Classification

作者:Yan Yan, Ying Wang, Wen-Chao Gao, Bo-Wen Zhang, Chun Yang, Xu-Cheng Yin

摘要

Multi-label document classification is a typical challenge in many real-world applications. Multi-label ranking is a common approach, while existing studies usually disregard the effects of context and the relationships among labels during the scoring process. In this paper, we propose an Long Short Term Memory (LSTM)-based multi-label ranking model for document classification, namely LSTM\(^2\) consisting of repLSTM—an adaptive data representation process and rankLSTM—a unified learning-ranking process. In repLSTM, the supervised LSTM is used to learn document representation by incorporating the document labels. In rankLSTM, the order of the documents labels is rearranged in accordance with a semantic tree, in which the semantics are compatible with and appropriate to the sequential learning of LSTM. The model can be wholly trained by sequentially predicting labels. Connectionist Temporal Classification is performed in rankLSTM to address the error propagation for a variable number of labels in each document. Moreover, a variety of experiments with document classification conducted on three typical datasets reveal the impressive performance of our proposed approach.

论文关键词:LSTM\(^{2}\) (repLSTM, rankLSTM), Multi-label ranking, Document classification, Deep learning, Semantic indexing

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-017-9636-0