An algorithm for the calculation of exact term discrimination values

作者:

Highlights:

摘要

Term discrimination values have been suggested as an effective means for the selection and weighting of index terms in automatic document retrieval systems. This paper reports an algorithm for the calculation of term discrimination values that is sufficiently fast in operation to permit the use of exact values, rather than the approximate values studied in previous work. Evidence is presented to show that the relationship between term discrimination and term frequency is crucially dependent upon the type of inter-document similarity measure that is used for the calculation of the discrimination values.

论文关键词:

论文评审过程:Received 28 September 1984, Available online 18 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(85)90107-4