Improving linear classifier for Chinese text categorization

作者:

Highlights:

摘要

The goal of this paper is to derive extra representatives from each class to compensate for the potential weakness of linear classifiers that compute one representative for each class. To evaluate the effectiveness of our approach, we compared with linear classifier produced by Rocchio algorithm and the k-nearest neighbor (kNN) classifier. Experimental results show that our approach improved linear classifier and achieved micro-averaged accuracy close to that of kNN, with much less classification time. Furthermore, we could provide a suggestion to reorganize the structure of classes when identify new representatives for linear classifier.

论文关键词:Information retrieval,Linear classifier,Text categorization

论文评审过程:Received 7 September 2001, Accepted 25 September 2002, Available online 17 December 2003.

论文官网地址:https://doi.org/10.1016/S0306-4573(02)00089-4