Incremental learning of context free grammars based on bottom-up parsing and search

作者:

Highlights:

摘要

This paper describes approaches for machine learning of context free grammars (CFGs) from positive and negative sample strings, which are implemented in Synapse system. The grammatical inference consists of a rule generation by “inductive CYK algorithm,” mechanisms for incremental learning, and search. Inductive CYK algorithm generates minimum production rules required for parsing positive samples, when the bottom-up parsing by CYK algorithm does not succeed. The incremental learning is used not only for synthesizing grammars by giving the system positive strings in the order of their length but also for learning grammars from other similar grammars. Synapse can synthesize fundamental ambiguous and unambiguous CFGs including nontrivial grammars such as the set of strings not of the form ww with w∈{a,b}+.

论文关键词:Incremental learning,CYK algorithm,Context free language,Unambiguous grammar,Chomsky normal form,Synapse

论文评审过程:Received 17 March 2004, Available online 5 March 2005.

论文官网地址:https://doi.org/10.1016/j.patcog.2005.01.004