Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities

作者:

Highlights:

摘要

This work aims to extract possible causal relations that exist between noun phrases. Some causal relations are manifested by lexical patterns like causal verbs and their sub-categorization. We use lexical patterns as a filter to find causality candidates and we transfer the causality extraction problem to the binary classification. To solve the problem, we introduce probabilities for word pair and concept pair that could be part of causal noun phrase pairs. We also use the cue phrase probability that could be a causality pattern. These probabilities are learned from the raw corpus in an unsupervised manner. With this probabilistic model, we increase both precision and recall. Our causality extraction shows an F-score of 77.37%, which is an improvement of 21.14 percentage points over the baseline model. The long distance causal relation is extracted with the binary tree-styled cue phrase. We propose an incremental cue phrase learning method based on the cue phrase confidence score that was measured after each causal classifier learning step. A better recall of 15.37 percentage points is acquired after the cue phrase learning.

论文关键词:Causality,Pattern learning,Word pair probability,Cue phrase probability,Unsupervised learning

论文评审过程:Received 5 October 2004, Accepted 11 April 2005, Available online 15 June 2005.

论文官网地址:https://doi.org/10.1016/j.ipm.2005.04.004