Abstractive headline generation using WIDL-expressions

作者:

Highlights:

摘要

We present a new paradigm for the automatic creation of document headlines that is based on direct transformation of relevant textual information into well-formed textual output. Starting from an input document, we automatically create compact representations of weighted finite sets of strings, called WIDL-expressions, which encode the most important topics in the document. A generic natural language generation engine performs the headline generation task, driven by both statistical knowledge encapsulated in WIDL-expressions (representing topic biases induced by the input document) and statistical knowledge encapsulated in language models (representing biases induced by the target language). Our evaluation shows similar performance in quality with a state-of-the-art, extractive approach to headline generation, and significant improvements in quality over previously proposed solutions to abstractive headline generation.

论文关键词:Natural language generation,Automatic summarization,WIDL-expressions

论文评审过程:Received 19 June 2006, Revised 2 January 2007, Accepted 8 January 2007, Available online 26 March 2007.

论文官网地址:https://doi.org/10.1016/j.ipm.2007.01.017