Learning context-free grammars using tabular representations

作者:

Highlights:

摘要

We present a novel algorithm using new hypothesis representations for learning context-free grammars from a finite set of positive and negative examples. We propose an efficient hypothesis representation method which consists of a table-like data structure similar to the parse table used in efficient parsing algorithms for context-free grammars such as Cocke–Younger–Kasami algorithm. By employing this representation method, the problem of learning context-free grammars from examples can be reduced to the problem of partitioning the set of nonterminals. We use genetic algorithms for solving this partitioning problem. Further, we incorporate partially structured examples to improve the efficiency of our learning algorithm, where a structured example is represented by a string with some parentheses inserted to indicate the shape of the derivation tree of the unknown grammar. We demonstrate some experimental results using these algorithms and theoretically analyse the completeness of the search space using the tabular method for context-free grammars.

论文关键词:Grammatical inference,Context-free grammar,Genetic algorithm,Structured example,Dynamic programming

论文评审过程:Received 17 March 2004, Accepted 17 March 2004, Available online 3 March 2005.

论文官网地址:https://doi.org/10.1016/j.patcog.2004.03.021