Supporting web query expansion efficiently using multi-granularity indexing and query processing

作者:

Highlights:

摘要

The problem of word mismatch in information retrieval (IR) occurs because users often use different words to describe concepts in their queries than authors use to describe the same concepts in their documents. Query expansion is used to deal with the mismatch between author and user vocabularies. To support query expansion, indices on words related by lexical semantics and syntactical co-occurrence need to be maintained. Two issues become paramount in supporting query expansion: the size of index tables and the query processing overhead. In this paper, we propose to use the notion of multi-granularity for more efficient indexing and query processing while the same degrees of precision and recall are maintained. We also describes extensions of this technique to handle: (1) query relaxation to handle words with multiple senses and with other semantic relationships; (2) progressive processing of queries with top N results and (3) progressive processing of queries with specification of the importance of each keyword.

论文关键词:Query expansion,Information retrieval,World-wide web,Indexing,Query processing,Lexical-semantics,Co-occurrence,Progressive processing

论文评审过程:Received 22 February 1999, Revised 21 July 1999, Accepted 22 May 2000, Available online 6 September 2000.

论文官网地址:https://doi.org/10.1016/S0169-023X(00)00024-0