论文标题
RNN学习过程中随机性与结构之间的相互作用
The interplay between randomness and structure during learning in RNNs
论文作者
论文摘要
经过低维任务训练的经过培训的复发性神经网络(RNN)已被广泛用于建模功能生物网络。但是,通过学习发现的解决方案和初始连通性的影响尚不清楚。在这里,我们检查了使用梯度下降对受神经科学文献启发的不同任务进行训练的RNN。我们发现,尽管学习算法的性质不受限制,但可以通过低级矩阵来描述复发性连通性的变化。为了确定低级结构的起源,我们转向可分析的设置:在简化任务上训练线性RNN。我们展示了低维度的任务结构如何导致连通性的低级别变化。这种低级结构使我们能够在存在随机初始连通性的情况下解释和量化加速学习的现象。总的来说,我们的研究开辟了一个新的观点,可以从学习过程和由此产生的网络结构方面了解经过训练的RNN。
Recurrent neural networks (RNNs) trained on low-dimensional tasks have been widely used to model functional biological networks. However, the solutions found by learning and the effect of initial connectivity are not well understood. Here, we examine RNNs trained using gradient descent on different tasks inspired by the neuroscience literature. We find that the changes in recurrent connectivity can be described by low-rank matrices, despite the unconstrained nature of the learning algorithm. To identify the origin of the low-rank structure, we turn to an analytically tractable setting: training a linear RNN on a simplified task. We show how the low-dimensional task structure leads to low-rank changes to connectivity. This low-rank structure allows us to explain and quantify the phenomenon of accelerated learning in the presence of random initial connectivity. Altogether, our study opens a new perspective to understanding trained RNNs in terms of both the learning process and the resulting network structure.