Node similarity in the citation graph

作者:Wangzhong Lu, J. Janssen, E. Milios, N. Japkowicz, Yongzheng Zhang

摘要

Published scientific articles are linked together into a graph, the citation graph, through their citations. This paper explores the notion of similarity based on connectivity alone, and proposes several algorithms to quantify it. Our metrics take advantage of the local neighborhoods of the nodes in the citation graph. Two variants of link-based similarity estimation between two nodes are described, one based on the separate local neighborhoods of the nodes, and another based on the joint local neighborhood expanded from both nodes at the same time. The algorithms are implemented and evaluated on a subgraph of the citation graph of computer science in a retrieval context. The results are compared with text-based similarity, and demonstrate the complementarity of link-based and text-based retrieval.

论文关键词:Networked information spaces, Document similarity metric, Citation graph, Digital libraries

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-006-0023-9