Selecting optimal instantiations of data models—Theory and validation of an ex ante approach

作者:

Highlights:

摘要

The schema of an information system can significantly impact the ability of end users to efficiently and effectively retrieve the information they need. Obtaining quickly the appropriate data increases the likelihood that an organization will make good decisions and respond adeptly to challenges. This research presents and validates a methodology for evaluating, ex ante, the relative desirability of alternative instantiations of a model of data. In contrast to prior research, each instantiation is based on a different formal theory. This research theorizes that the instantiation that yields the lowest weighted average query complexity for a representative sample of information requests is the most desirable instantiation for end-user queries. The theory was validated by an experiment that compared end-user performance using an instantiation of a data structure based on the relational model of data with performance using the corresponding instantiation of the data structure based on the object-relational model of data. Complexity was measured using three different Halstead metrics: program length, difficulty, and effort. For a representative sample of queries, the average complexity using each instantiation was calculated. As theorized, end users querying the instantiation with the lower average complexity made fewer semantic errors, i.e., were more effective at composing queries.

论文关键词:Models of data,Data representations,Object-relational databases,Relational databases,Query languages,Query complexity

论文评审过程:Received 20 July 2004, Revised 7 October 2005, Accepted 13 October 2005, Available online 4 January 2006.

论文官网地址:https://doi.org/10.1016/j.dss.2005.10.002