Taxonomy induction based on a collaboratively built knowledge repository

作者:

Highlights:

摘要

The category system in Wikipedia can be taken as a conceptual network. We label the semantic relations between categories using methods based on connectivity in the network and lexico-syntactic matching. The result is a large scale taxonomy. For evaluation we propose a method which (1) manually determines the quality of our taxonomy, and (2) automatically compares its coverage with ResearchCyc, one of the largest manually created ontologies, and the lexical database WordNet. Additionally, we perform an extrinsic evaluation by computing semantic similarity between words in benchmarking datasets. The results show that the taxonomy compares favorably in quality and coverage with broad-coverage manually created resources.

论文关键词:Natural language processing,Knowledge acquisition,Lexical semantics

论文评审过程:Received 23 August 2010, Revised 6 January 2011, Accepted 10 January 2011, Available online 12 January 2011.

论文官网地址:https://doi.org/10.1016/j.artint.2011.01.003