基于传感器机器人控制的基本限制

论文标题

基于传感器机器人控制的基本限制

Fundamental Limits for Sensor-Based Robot Control

论文作者

Majumdar, Anirudha, Mei, Zhiting, Pacelli, Vincent

论文摘要

我们的目标是开发理论和算法，以建立对机器人传感器对给定任务施加的性能的基本限制。为了实现这一目标，我们定义了一个捕获传感器提供的与任务相关的信息量的数量。使用信息理论的新颖版本的通用Fano不平等现象，我们证明了该数量为一步决策任务提供了最高可实现的预期奖励的上限。然后，我们通过动态编程方法扩展到多步问题。 We present algorithms for numerically computing the resulting bounds, and demonstrate our approach on three examples: (i) the lava problem from the literature on partially observable Markov decision processes, (ii) an example with continuous state and observation spaces corresponding to a robot catching a freely-falling object, and (iii) obstacle avoidance using a depth sensor with non-Gaussian noise.我们通过将我们的上限与可实现的下限进行比较（通过合成或学习具体控制策略计算），证明了我们的方法对这些问题的可实现绩效的强大限制的能力。

Our goal is to develop theory and algorithms for establishing fundamental limits on performance imposed by a robot's sensors for a given task. In order to achieve this, we define a quantity that captures the amount of task-relevant information provided by a sensor. Using a novel version of the generalized Fano inequality from information theory, we demonstrate that this quantity provides an upper bound on the highest achievable expected reward for one-step decision making tasks. We then extend this bound to multi-step problems via a dynamic programming approach. We present algorithms for numerically computing the resulting bounds, and demonstrate our approach on three examples: (i) the lava problem from the literature on partially observable Markov decision processes, (ii) an example with continuous state and observation spaces corresponding to a robot catching a freely-falling object, and (iii) obstacle avoidance using a depth sensor with non-Gaussian noise. We demonstrate the ability of our approach to establish strong limits on achievable performance for these problems by comparing our upper bounds with achievable lower bounds (computed by synthesizing or learning concrete control policies).

下载PDF全文

下载文献需遵守相关版权规定

论文标题