LABRADOR: Efficiently publishing relational databases on the web by using keyword-based query interfaces

作者:

Highlights:

摘要

A vast amount of valuable information, produced and consumed by people and institutions, is currently stored in relational databases. For many purposes, there is an ever increasing demand for having these databases published on the Web, so that users can query the data available in them. An important requirement for this to happen is that query interfaces must be as simple and intuitive as possible. In this paper we present LABRADOR, a system for efficiently publishing relational databases on the Web by using a simple text box query interface. The system operates by taking an unstructured keyword-based query posed by a user and automatically deriving an equivalent SQL query that fits the user’s information needs, as expressed by the original query. The SQL query is then sent to a DBMS and its results are processed by LABRADOR to create a relevance-based ranking of the answers. Experiments we present show that LABRADOR can automatically find the most suitable SQL query in more than 75% of the cases, and that the overhead introduced by the system in the overall query processing time is almost insignificant. Furthermore, the system operates in a non-intrusive way, since it requires no modifications to the target database schema.

论文关键词:Keyword-based queries,Web databases,Bayesian networks

论文评审过程:Received 26 April 2006, Revised 21 September 2006, Accepted 22 September 2006, Available online 5 December 2006.

论文官网地址:https://doi.org/10.1016/j.ipm.2006.09.018