BECKEY: Understanding, comparing and discovering keys of different semantics in knowledge bases

作者:

Highlights:

摘要

Integrating data coming from different knowledge bases has been one of the most important tasks in the Semantic Web the last years. Keys have been considered to be very useful in the data linking task. A set of properties is considered a key if it uniquely identifies every resource in the data. To cope with the incompleteness of the data, three different key semantics have been proposed so far. We propose BECKEY, a semantic agnostic approach that discovers keys for all three semantics, succeeding to scale on large datasets. Our approach is able to discover keys under the presence of erroneous data or duplicates (i.e., almost keys). A formalisation of the three semantics along with the relations among them is provided. An extended experimental comparison of the three key semantics has taken place. The results allow a better understanding of the three semantics, providing insights on when each semantic is more appropriate for the task of data linking.

论文关键词:Semantic Web,Key discovery,Data linking,Key semantics,RDF,Semantic agnostic approach

论文评审过程:Received 28 June 2019, Revised 10 January 2020, Accepted 28 February 2020, Available online 2 March 2020, Version of Record 4 April 2020.

论文官网地址:https://doi.org/10.1016/j.knosys.2020.105708