Signature file methods for implementing a ranking strategy

作者:

Highlights:

摘要

In this paper we present two partitioning methods for signature files in order to implement the tf × idf ranking strategy efficiently. The methods represent term frequencies without storing them explicitly. The first method partitions terms in a document based upon their term frequencies. The second one further partitions the terms vertically based upon their ordinal numbers in the dictionary. The latter allows partial retrieval of the signature files in response to a query. A fast weight computation method is also described. Detailed analysis of the new methods is given. Experimental runs are performed on the document collections made available with the SMART system.

论文关键词:

论文评审过程:Received 1 April 1989, Accepted 6 March 1990, Available online 13 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(90)90107-D