论文标题
非线性神经反馈回路的稳定性使用正方形之和
Stability of Non-linear Neural Feedback Loops using Sum of Squares
论文作者
论文摘要
与传统控制器相比,神经网络控制器有潜力提高反馈系统的性能,因为它们可以充当一般功能近似器。但是,由于神经网络内部激活功能的非线性,量化其安全性和鲁棒性特性已被证明具有挑战性。一个关键的鲁棒性指标是证明反馈系统的稳定性属性,并提供了一个吸引人的区域,该区域已在以前的文献中解决。但是,这些作品仅解决线性系统,或者需要一个人来抽象植物非线性,并使用坡度和部门约束来绑定它们。在本文中,我们使用一系列正方形编程框架来直接使用神经网络控制器来计算非线性系统的稳定性。在此框架内,我们可以提出具有更丰富的结构的高阶候选Lyapunov功能,能够更好地捕获神经网络中非线性系统和非线性的动力学。我们还能够在连续时间分析这些系统,而其他方法则依赖于离散系统。这些高阶lyapunov函数与绑定神经网络输入输出属性的不平等和平等约束的高阶乘数结合使用。与其他方法相比,计算出的吸引力区域的体积增加了,从而可以更好地确保系统稳定性。我们还能够轻松地分析非线性多项式系统,这与其他方法无法做到。我们还能够对参数不确定性进行鲁棒性分析。我们使用数值示例显示了我们方法的好处。
Neural network controllers have the potential to improve the performance of feedback systems compared to traditional controllers, due to their ability to act as general function approximators. However, quantifying their safety and robustness properties has proven challenging due to the non-linearities of the activation functions inside the neural network. A key robustness indicator is certifying the stability properties of the feedback system and providing a region of attraction, which has been addressed in previous literature. However, these works only address linear systems or require one to abstract the plant non-linearities and bound them using slope and sector constraints. In this paper we use a Sum of Squares programming framework to compute the stability of non-linear systems with neural network controllers directly. Within this framework, we can propose higher order candidate Lyapunov functions with richer structures that are able to better capture the dynamics of the non-linear system and the nonlinearities in the neural network. We are also able to analyse these systems in continuous time, whereas other methods rely on discretising the system. These higher order Lyapunov functions are used in conjunction with higher order multipliers on the inequality and equality constraints that bound the neural network input-output properties. The volume of the region of attraction computed is increased compared to other methods, allowing for better safety guarantees on the stability of the system. We are also able to easily analyse non-linear polynomial systems, which is not possible to do with other methods. We are also able to conduct robustness analysis on the parameter uncertainty. We show the benefits of our method using numerical examples.