Cerno: Light-weight tool support for semantic annotation of textual documents

作者:

Highlights:

摘要

Enrichment of text documents with semantic metadata reflecting their meaning facilitates document organization, indexing and retrieval. However, most web data remain unstructured because of the difficulty and the cost of manually annotating text. In this work, we present Cerno, a framework for semi-automatic semantic annotation of textual documents according to a domain-specific semantic model. The proposed framework is founded on light-weight techniques and tools intended for legacy code analysis and markup. To illustrate the feasibility of our proposal, we report experimental results of its application to two different domains. These results suggest that light-weight semi-automatic techniques for semantic annotation are feasible, require limited human effort for adaptation to a new domain, and demonstrate markup quality comparable with state-of-the-art methods.

论文关键词:8.4 Semantic web,23 Natural language for DKE,20.1 Re- and reverse engineering,Semantic annotation

论文评审过程:Received 5 February 2009, Revised 16 July 2009, Accepted 16 July 2009, Available online 23 July 2009.

论文官网地址:https://doi.org/10.1016/j.datak.2009.07.012