Exploiting semantics for XML keyword search

作者:

Highlights:

摘要

XML keyword search has attracted a lot of interests with typical search based on lowest common ancestor (LCA). However, in this paper, we show several problems of the LCA-based approaches, including meaningless answers, incomplete answers, duplicated answers, missing answers, and schema-dependent answers. To handle these problems, we exploit the semantics of object, object identifier, relationship, and attribute (referred to as the ORA-semantics). Based on the ORA-semantics, we introduce new ways of labeling and matching. More importantly, we propose a new semantics, called CR (Common Relative) for XML keyword search, which can return answers independent from schema designs. To find answers based on the CR semantics, we discover properties of common relative and propose an efficient algorithms. Experimental results show the seriousness of the problems of the LCA-based approaches. They also show that the CR semantics possesses the properties of completeness, soundness and independence while the response time of our approach is faster than the LCA-based approaches thanks to our techniques.

论文关键词:XML,Keyword search,Object,LCA,Independence,Semantics

论文评审过程:Received 23 February 2015, Accepted 9 June 2015, Available online 18 June 2015, Version of Record 27 September 2015.

论文官网地址:https://doi.org/10.1016/j.datak.2015.06.003