Quantifying the impact of concept recognition on biomedical information retrieval

作者：

Highlights：

•

摘要

In ad hoc querying of document collections, current approaches to ranking primarily rely on identifying the documents that contain the query terms. Methods such as query expansion, based on thesaural information or automatic feedback, are used to add further terms, and can yield significant though usually small gains in effectiveness. Another approach to adding terms, which we investigate in this paper, is to use natural language technology to annotate – and thus disambiguate – key terms by the concept they represent. Using biomedical research documents, we quantify the potential benefits of tagging users’ targeted concepts in queries and documents in domain-specific information retrieval. Our experiments, based on the TREC Genomics track data, both on passage and full-text retrieval, found no evidence that automatic concept recognition in general is of significant value for this task. Moreover, the issues raised by these results suggest that it is difficult for such disambiguation to be effective.

论文关键词：Biomedical information retrieval,Keywords search,Named-entity recognition

论文评审过程：Received 1 February 2010, Revised 6 December 2010, Accepted 22 February 2011, Available online 31 March 2011.

论文官网地址：https://doi.org/10.1016/j.ipm.2011.02.009