Discovering entity inheritance relationships in document stores

作者:

Highlights:

摘要

While declaring schemas is not required in most NoSQL stores, explicit schemas are crucial to provide essential capabilities in the development of database applications such as schema visualization, code generation, and data quality improvement. Therefore, some published works and data modeling tools have recently presented reverse-engineering approaches to extract NoSQL schemas from data or code. None of them have addressed the inference of inheritance relationships in which a parent entity type is specialized in one or more child entity types. In this paper, we present a strategy to discover entity inheritance relationships from the set of existing structural variations of an entity type. The algorithm proposed also identifies the discriminator field, if it exists. A validation process has been defined, which is applied to three real datasets.

论文关键词:NoSQL systems,Document stores,Schema extraction,Inheritance relationship inference,Polymorphic pattern

论文评审过程:Received 16 March 2021, Revised 28 May 2021, Accepted 11 August 2021, Available online 12 August 2021, Version of Record 20 August 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107394