A Layer-by-Layer Least Squares based Recurrent Networks Training Algorithm: Stalling and Escape

作者:Siu-yeung Cho, Tommy W.S. Chow

摘要

The limitations of the least squares based training algorithm is dominated by stalling problem and evaluation error by transformation matrix to obtain an unacceptable solution. This paper presents a new approach for the recurrent networks training algorithm based upon the Layer-by-Layer Least Squares based algorithm to overcome the aforementioned problems. In accordance with our proposed algorithm, all the weights are evaluated by the least squares method without the evaluation of transformation matrix to speed up the rate of convergence. A probabilistic mechanism, based upon the modified weights updated equations, is introduced to eliminate the stalling problem experienced by the pure least squares type computation. As a result, the merits of the proposed algorithm are capable of providing an ability of escaping from local minima to obtain a good optimal solution and still maintaining the characteristic of fast convergence.

论文关键词:convergence stalling, fast convergence speed, Layer-by-Layer Least Squares algorithm, recurrent networks

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1009672319869