Refining non-taxonomic relation labels with external structured data to support ontology learning

作者:

Highlights:

摘要

This paper presents a method to integrate external knowledge sources such as DBpedia and OpenCyc into an ontology learning system that automatically suggests labels for unknown relations in domain ontologies based on large corpora of unstructured text. The method extracts and aggregates verb vectors from semantic relations identified in the corpus. It composes a knowledge base which consists of (i) verb centroids for known relations between domain concepts, (ii) mappings between concept pairs and the types of known relations, and (iii) ontological knowledge retrieved from external sources. Applying semantic inference and validation to this knowledge base improves the quality of suggested relation labels. A formal evaluation compares the accuracy and average ranking precision of this hybrid method with the performance of methods that solely rely on corpus data and those that are only based on reasoning and external data sources.

论文关键词:Ontologies,Semantic web,Ontology learning,Relation labeling,Machine learning

论文评审过程:Received 13 July 2009, Revised 19 February 2010, Accepted 23 February 2010, Available online 2 March 2010.

论文官网地址:https://doi.org/10.1016/j.datak.2010.02.010