Cut and Paste

作者:

Highlights:

摘要

The paper develops Editor, a language for manipulating semistructured documents, such as those typically available on the Web. Editor programs are based on two simple ideas, taken from text editors: “search” instructions are used to select regions of interest in a document, and “cut & paste” instructions to restructure them. We study the expressive power and the complexity of these programs. We show that they are computationally complete, in the sense that any computable document restructuring can be expressed in Editor. We also study the complexity of a safe subclass of programs, showing that it captures exactly the class of polynomial-time restructurings. The language has been implemented in Java and is currently used in the Araneus project as a basis for a wrapper-generation toolkit.

论文关键词:

论文评审过程:Received 15 December 1997, Revised 30 September 1998, Available online 25 May 2002.

论文官网地址:https://doi.org/10.1006/jcss.1999.1623