Exploiting edge semantics in citation graphs using efficient, vertical ARM

作者:Imad Rahal, Dongmei Ren, Weihua Wu, Anne Denton, Christopher Besemann, William Perrizo

摘要

Graphs are increasingly becoming a vital source of information within which a great deal of semantics is embedded. As the size of available graphs increases, our ability to arrive at the embedded semantics grows into a much more complicated task. One form of important hidden semantics is that which is embedded in the edges of directed graphs. Citation graphs serve as a good example in this context. This paper attempts to understand temporal aspects in publication trends through citation graphs, by identifying patterns in the subject matters of scientific publications using an efficient, vertical association rule mining model. Such patterns can (a) indicate subject-matter evolutionary history, (b) highlight subject-matter future extensions, and (c) give insights on the potential effects of current research on future research. We highlight our major differences with previous work in the areas of graph mining, citation mining, and Web-structure mining, propose an efficient vertical data representation model, introduce a new subjective interestingness measure for evaluating patterns with a special focus on those patterns that signify strong associations between properties of cited papers and citing papers, and present an efficient algorithm for the purpose of discovering rules of interest followed by a detailed experimental analysis.

论文关键词:Citation analysis, Citation graphs, Association rule mining, Frequent itemset mining, Data mining, Graph databases, Link analysis, P-trees

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-005-0229-2