Automatic induction of rules for text simplification1

作者:

Highlights:

摘要

Long and complicated sentences pose various problems to many state-of-the-art natural language technologies. We have been exploring methods to automatically transform such sentences in order to make them simpler. These methods involve the use of a rule-based system, driven by the syntax of the text in the domain of interest. Hand-crafting rules for every domain is time-consuming and impractical. The paper describes an algorithm and an implementation by which generalized rules for simplification are automatically induced from annotated training material using a novel partial parsing technique which combines constituent structure and dependency information. The algorithm described in the paper employs example-based generalizations on linguistically motivated structures.

论文关键词:Text-simplification,Supertags,Dependency

论文评审过程:Received 15 May 1997, Accepted 29 May 1997, Available online 19 May 1998.

论文官网地址:https://doi.org/10.1016/S0950-7051(97)00029-4