Using ontology to improve precision of terminology extraction from documents

作者:

Highlights:

摘要

In this paper, we proposed a new approach using ontology to improve precision of terminology extraction from documents. Firstly, a linguistic method was used to extract the terminological patterns from documents. Then, similarity measures within the framework of ontology were employed to rank the semantic dependency of the noun words in a pattern. Finally, the patterns at a predefined proportion according to their semantic dependencies were retained and regarded as terminologies. Experiments on Retuers-21578 corpus has shown that WordNet ontology, that we adopted for the task of extracting terminologies from English documents, can improve the precision of classical linguistic method on terminology extraction significantly.

论文关键词:Terminology extraction,Ontology,Semantic dependency,WordNet

论文评审过程:Received 24 May 2008, Accepted 7 December 2008, Available online 24 December 2008.

论文官网地址:https://doi.org/10.1016/j.eswa.2008.12.034