论文标题
在数据库攻击下的LQ自适应控制(扩展版本)下的LQ自适应控制范围
Regret Bounds for LQ Adaptive Control Under Database Attacks (Extended Version)
论文作者
论文摘要
本文涉及理解和反对数据库攻击对基于学习的线性二次自适应控制器的影响。该攻击既不针对传感器也不针对执行器,而只是毒物,而毒物是监管方案的一部分学习算法和参数估计器。我们专注于Abbasi-Yadkori和Szepesvari引入的自适应最佳控制算法,并在存在攻击的情况下提供遗憾分析以及减轻其效果的修改。该算法的一个核心步骤是自我调节的在线最小二乘估计,该估计决定了围绕系统的真实参数的紧密置信度,其概率很高。在没有恶意数据注入的情况下,该集合为控制设计的目的提供了适当的参数估算值。但是,在存在攻击的情况下,这种信心集已不再可靠。因此,我们首先解决了如何调整置信设置的问题,以便可以补偿有毒数据的效果。然后,我们通过在攻击中限制了闭环系统的遗憾,量化了这种攻击对控制策略的最优性的有害影响。
This paper is concerned with understanding and countering the effects of database attacks on a learning-based linear quadratic adaptive controller. This attack targets neither sensors nor actuators, but just poisons the learning algorithm and parameter estimator that is part of the regulation scheme. We focus on the adaptive optimal control algorithm introduced by Abbasi-Yadkori and Szepesvari and provide regret analysis in the presence of attacks as well as modifications that mitigate their effects. A core step of this algorithm is the self-regularized on-line least squares estimation, which determines a tight confidence set around the true parameters of the system with high probability. In the absence of malicious data injection, this set provides an appropriate estimate of parameters for the aim of control design. However, in the presence of attack, this confidence set is not reliable anymore. Hence, we first tackle the question of how to adjust the confidence set so that it can compensate for the effect of the poisonous data. Then, we quantify the deleterious effect of this type of attack on the optimality of control policy by bounding regret of the closed-loop system under attack.