A learning scheme for information retrieval in hypertext

作者:

Highlights:

摘要

In proposing a searching strategy well suited to the hypertext environment, we have considered four criteria: (1) the retrieval scheme should be integrated into a large hypertext environment; (2) the retrieval process should be operable with an unrestricted text collection; (3) the processing time should be reasonable; and (4) the system should be capable of learning in order to improve its retrieval effectiveness.To satisfy these four criteria, we have designed and implemented a search strategy for hypertext systems based on an extended Boolean model (the p-norm scheme) and supplemented it with links to improve the ranking of the retrieved items in a sequence most likely to fulfill the intent of the user. These links, representing additional information about document content, are established according to the requests and relevance judgments. Using a fully automatic procedure, our retrieval scheme can be applied to most existing systems. Based on the CACM test collection, which includes 3,204 documents and the CISI corpus (1,460 documents), we have built a hypertext and evaluated our proposed strategy. The retrieval effectiveness of our solution presents encouraging results.

论文关键词:Hypertext,Information retrieval,Learning scheme,Hypertext link semantics,p-norm model,Probabilistic retrieval model

论文评审过程:Received 12 November 1992, Accepted 13 September 1993, Available online 19 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(94)90037-X