Automated conversion from natural language query to SPARQL query

作者:Haemin Jung, Wooju Kim

摘要

Nowadays, domain ontologies are widely used as background knowledge bases. However, end users of ontology-based question answering (QA) systems are unaware of major concepts of ontology or the structure of domain ontology schema. Thus, it has been essential to provide an efficient method to reduce this gap. Namely, the critical issue for ontology-based QA systems is how to generate a SPARQL query from a user’s natural language query (NLQ). Therefore, we proposed a method to generate SPARQL queries from Korean natural language queries. When an input query comes in, we split it into a set of tokens and map each token to certain resources in the ontology. Subsequently, a graph generation process creates multiple “query graphs” by arranging the resources and identifying relationships between them. To identify relations between resources, we applied a path search algorithm based on the structure of domain ontology schema. We score query graphs by measuring the degree to which the graph reflected the general user’s intent, and the best-rated query graph is converted into a SPARQL query. We implemented a prototype system to evaluate the proposed method for the music domain ontology and conclude that our query conversion process can convert Korean natural language queries into semantically equivalent SPARQL queries. We anticipate that, after appropriate modification, the process can be applied to other languages.

论文关键词:SPARQL generation, Domain ontology, Natural language query, Korean language

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-019-00589-2