Schema matching based on position of attribute in query statement

作者:

Highlights:

摘要

Attribute-level schema matching is a critical step in numerous database applications, such as DataSpaces, Ontology Merging and Schema Integration. There exist many researches on this topic, however, they all ignore evidences about the positions of attributes in query statements, which are crucial to find high-quality matches between schema attributes. In this paper, we propose a novel matching technique based on the positions of attributes appearing in the schema structure of query results. The positions of attributes in query results embody the extent of the importance of an attribute for the user browsing the query results. The core idea of our approach is to collect the statistics about attribute positions from query logs to find correspondences between attributes (matches). Our method works in three phases. The first phase is to design a matrix to record the statistics about attribute positions. Then, we employ two scoring functions to measure the similarities between collected statistics of two schemas to be matched. Finally, we employ a traditional algorithm to find the optimal mapping. Furthermore, our approach can be combined with other existing matchers to obtain more accurate matching results. An experimental study shows that our approach is effective and has good performance.

论文关键词:Schema matching,Database integration,Query log,Ant Colony Optimization,Attribute position,Query statement

论文评审过程:Received 21 April 2013, Revised 11 June 2014, Accepted 9 November 2014, Available online 11 December 2014.

论文官网地址:https://doi.org/10.1016/j.knosys.2014.11.005