论文标题
神经网络动态的计算机验证:第一个案例研究
Computer Validation of Neural Network Dynamics: A First Case Study
论文作者
论文摘要
当前的大量机器学习方法依赖于深层神经网络。然而,将神经网络视为非线性动力学系统,很快就会很快地,在数学上严格建立网络中的节点产生的某些模式非常困难。的确,它在复杂系统的非线性动力学中得到了很好的理解,即使在低维模型中,植根于铅笔和纸的方法也经常达到其极限。在这项工作中,我们通过验证的非线性动力学数值方法的范式提出了完全不同的观点。这个想法是使用计算机辅助的证据来数学上验证神经网络中非线性模式的存在。作为一个案例研究,我们考虑了一类经常性的神经网络,在该网络中,我们通过计算机援助证明了数百个霍普夫分叉点,它们的非分类元,因此也存在数百个周期性轨道。我们的范式有能力严格验证神经网络的复杂非线性行为,这提供了通过计算机辅助的证据来解释机器学习方法的全部能力以及潜在敏感性的第一步。我们展示了经过验证的数值技术如何阐明复发性神经网络(RNN)的内部工作。为此,通过证明在各种环境中存在周期性轨道的存在,HOPF分叉的证明是迈向RNN实际应用中动态系统理论迈出的第一步。
A large number of current machine learning methods rely upon deep neural networks. Yet, viewing neural networks as nonlinear dynamical systems, it becomes quickly apparent that mathematically rigorously establishing certain patterns generated by the nodes in the network is extremely difficult. Indeed, it is well-understood in the nonlinear dynamics of complex systems that, even in low-dimensional models, analytical techniques rooted in pencil-and-paper approaches frequently reach their limits. In this work, we propose a completely different perspective via the paradigm of validated numerical methods of nonlinear dynamics. The idea is to use computer-assisted proofs to validate mathematically the existence of nonlinear patterns in neural networks. As a case study, we consider a class of recurrent neural networks, where we prove via computer assistance the existence of several hundred Hopf bifurcation points, their non-degeneracy, and hence also the existence of several hundred periodic orbits. Our paradigm has the capability to rigorously verify complex nonlinear behaviour of neural networks, which provides a first step to explain the full abilities, as well as potential sensitivities, of machine learning methods via computer-assisted proofs. We showcase how validated numerical techniques can shed light on the internal working of recurrent neural networks (RNNs). For this, proofs of Hopf bifurcations are a first step towards an integration of dynamical system theory in practical application of RNNs, by proving the existence of periodic orbits in a variety of settings.