论文标题
现实世界中的人机协作加强学习
Real-World Human-Robot Collaborative Reinforcement Learning
论文作者
论文摘要
人类和智能机器人(体现的AI)在现实世界中的直观合作是许多理想应用机器人技术的基本目标。尽管关于显式沟通的研究有很多研究,但我们将重点放在人类和机器人如何隐式相互作用上,在运动适应水平上。我们通过将操作限制在两个正交轴的旋转中,并将每个轴分配给一个玩家的旋转,介绍了人类机器人协作迷宫游戏的真实设置,该游戏旨在通过协作来解决。这既无法使人类也无法独自解决游戏。我们使用深度加强学习来控制机器人剂,并在现实游戏的30分钟内取得结果,而没有任何类型的预训练。然后,我们使用此设置来执行有关人/代理行为的系统实验,并在共同学习协作游戏的策略时进行适应。我们介绍了有关人类与机器人代理之间如何随着时间的推移进行的结果,从而导致每个参与者的代理商来表现他们如何玩游戏。通过将代理商的政策与自己的代理商的政策进行比较,这使我们可以在与其他代理人的策略与自己的代理人玩耍时的成功联系起来。
The intuitive collaboration of humans and intelligent robots (embodied AI) in the real-world is an essential objective for many desirable applications of robotics. Whilst there is much research regarding explicit communication, we focus on how humans and robots interact implicitly, on motor adaptation level. We present a real-world setup of a human-robot collaborative maze game, designed to be non-trivial and only solvable through collaboration, by limiting the actions to rotations of two orthogonal axes, and assigning each axes to one player. This results in neither the human nor the agent being able to solve the game on their own. We use deep reinforcement learning for the control of the robotic agent, and achieve results within 30 minutes of real-world play, without any type of pre-training. We then use this setup to perform systematic experiments on human/agent behaviour and adaptation when co-learning a policy for the collaborative game. We present results on how co-policy learning occurs over time between the human and the robotic agent resulting in each participant's agent serving as a representation of how they would play the game. This allows us to relate a person's success when playing with different agents than their own, by comparing the policy of the agent with that of their own agent.