DropCircuit : A Modular Regularizer for Parallel Circuit Networks

作者:Kien Tuong Phan, Tomas Henrique Maul, Tuong Thuy Vu, Weng Kin Lai

摘要

How to design and train increasingly large neural network models is a topic that has been actively researched for several years. However, while there exists a large number of studies on training deeper and/or wider models, there is relatively little systematic research particularly on the effective usage of wide modular neural networks. Addressing this gap, and in an attempt to solve the problem of lengthy training times, we proposed Parallel Circuits (PCs), a biologically inspired architecture based on the design of the retina. In previous work we showed that this approach fails to maintain generalization performance in spite of achieving sharp speed gains. To address this issue, and motivated by the way dropout prevents node co-adaptation, in this paper, we suggest an improvement by extending dropout to the parallel-circuit architecture. The paper provides empirical proof and multiple insights into this combination. Experiments show promising results in which improved error rates are achieved in most cases, whilst maintaining the speed advantage of the PC approach.

论文关键词:Parallel circuits, Deep learning, Dropout, DropCircuit

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-017-9677-4