Mining relational data from text: From strictly supervised to weakly supervised learning

作者:

Highlights:

摘要

This paper approaches the relation classification problem in information extraction framework with different machine learning strategies, from strictly supervised to weakly supervised. A number of learning algorithms are presented and empirically evaluated on a standard data set. We show that a supervised SVM classifier using various lexical and syntactic features can achieve competitive classification accuracy. Furthermore, a variety of weakly supervised learning algorithms can be applied to take advantage of large amount of unlabeled data when labeling is expensive. Newly introduced random-subspace-based algorithms demonstrate their empirical advantage over competitors in the context of both active learning and bootstrapping.

论文关键词:Information extraction,Relation classification,Active learning,Bootstrapping,Support vector machines,Random subspace method

论文评审过程:Received 29 October 2006, Revised 4 August 2007, Accepted 13 October 2007, Available online 27 October 2007.

论文官网地址:https://doi.org/10.1016/j.is.2007.10.002