Schema profiling of document-oriented databases

作者:

Highlights:

• In document-oriented databases, documents are stored using different schemata.

• The rules that drive the use of schemata are hidden in the application logic.

• We propose schema profiling to capture the rules explaining the use of schemata.

• We express rules in the form of an explicative, precise, and concise decision tree.

• To build this tree we couple value-based and schema-based conditions.

摘要

•In document-oriented databases, documents are stored using different schemata.•The rules that drive the use of schemata are hidden in the application logic.•We propose schema profiling to capture the rules explaining the use of schemata.•We express rules in the form of an explicative, precise, and concise decision tree.•To build this tree we couple value-based and schema-based conditions.

论文关键词:NoSQL,Document-oriented databases,Schema discovery,Decision trees

论文评审过程:Received 22 September 2016, Revised 24 January 2018, Accepted 16 February 2018, Available online 20 February 2018, Version of Record 27 February 2018.

论文官网地址:https://doi.org/10.1016/j.is.2018.02.007