Term co-occurrence in cited/citing journal articles as a measure of document similarity

作者:

Highlights:

摘要

Term co-occurrences were measured in pairs of cited/citing research articles selected over the period of time from 1971 until 1983 from a core literature in the field of information science. A consistent pattern of term similarity was observed in these article pairs. In contrast, document similarity was extremely low in randomly paired articles selected from the same core data base. In 77% of cited/citing articles, there were more co-occurrences of significant terms than there were in 87% of the same articles paired randomly. The study served to quantify terminology-relatedness. A comparison of the similarity of cited/citing literature of various ages resulted in an indication of the amount of new terminology entering the field. And, because a clear delineation was achieved between the similarity of cited/citing articles and the similarity of non-cited/citing articles, the results were extended to define an expected success rate of a matching procedure in one context of information retrieval.

论文关键词:

论文评审过程:Received 18 August 1986, Revised 7 November 1986, Available online 13 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(87)90003-3