Automatic extraction of shapes using sheXer

作者:

Highlights:

摘要

There is an increasing number of projects based on Knowledge Graphs and SPARQL endpoints. These SPARQL endpoints are later queried by final users or used to feed many different kinds of applications. Shape languages, such as ShEx and SHACL, have emerged to guide the evolution of these graphs and to validate their expected topology. However, authoring shapes for an existing knowledge graph is a time-consuming task. The task gets more challenging when dealing with sources, possibly maintained by heterogeneous agents. In this paper, we present sheXer, a system that extracts shapes by mining the graph structure. We offer sheXer as a free Python library capable of producing both ShEx and SHACL content. Compared to other automatic shape extractors, sheXer includes some novel features such as shape inter-linkage and computation of big real-world datasets. We analyze the features and limitations w.r.t. performance with different experiments using the English chapter of DBpedia.

论文关键词:Knowledge Graph,RDF,ShEx,SHACL,Automatic extraction

论文评审过程:Received 19 May 2021, Revised 18 October 2021, Accepted 13 December 2021, Available online 17 December 2021, Version of Record 31 December 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107975