Using a new relational concept to improve the clustering performance of search engines

作者:

Highlights:

摘要

In this paper, we present a novel clustering algorithm to generate a number of candidate clusters from other web search results. The candidate clusters generate a connective relation among the clusters and the relation is semantic. Moreover, the algorithm also contains the following attractive properties: (1) it can be applied to multilingual web documents, (2) it improves the clustering performance of any search engine, (3) its unsupervised learning can automatically identify potentially relevant knowledge without using any corpus, and (4) clustering results are generated on the fly and fitted into search engines.

论文关键词:Document clustering,Semantic relation,Relational concept,Web search engines,Web documents

论文评审过程:Received 8 April 2008, Revised 17 November 2008, Accepted 14 April 2010, Available online 7 May 2010.

论文官网地址:https://doi.org/10.1016/j.ipm.2010.04.001