Answering content and structure-based queries on XML documents using relevance propagation

作者:

Highlights:

摘要

As XML documents contain both content and structure information, taking advantage of the document structure in the retrieval process can lead to better identify relevant information units. In this paper, we describe an information retrieval (IR) approach dealing with queries composed of content and structure conditions. The XFIRM model we propose is designed to be as flexible as possible to process such queries. It is based on a complete query language, derived from XPath and on a relevance values propagation method. This paper aims at evaluating functions used in the propagation process, and particularly the use of distance between nodes as a parameter. The proposed method is evaluated, thanks to the INEX evaluation initiative. Results show a relative high precision of our proposal.

论文关键词:XML,Information retrieval,Relevance propagation method,Content and structure queries

论文评审过程:Available online 7 December 2005.

论文官网地址:https://doi.org/10.1016/j.is.2005.11.007