论文标题
多物质腿部机器人控制作为序列建模问题
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem
论文作者
论文摘要
传统上,机器人在其运营寿命中受到固定体现的限制,这限制了它们适应周围环境的能力。然而,由于控制器和形态之间的复杂相互作用,机器人的合作控制和形态通常效率低下。在本文中,我们提出了一种基于学习的控制方法,该方法可以固有地考虑形态,以便一旦对控制策略进行了模拟器训练,就可以轻松地将其部署到现实世界中具有不同实施方案的机器人中。特别是,我们介绍了实施方案感知变压器(EAT),该结构将此控制问题作为条件序列建模。 EAT通过利用因果掩盖的变压器来输出最佳动作。通过在所需的机器人实施例,过去状态和动作上调节自回旋模型,我们的EAT模型可以产生最适合当前机器人实施例的动作。实验结果表明,EAT可以在实施方式变化的任务中胜过所有其他替代方案,并在现实世界中进化任务的示例中取得成功:通过单独更新形态来降低楼梯。我们希望EAT会激发跨许多领域的现实发展的新推动,在这些领域中,诸如EAT之类的算法可以通过桥接进化机器人技术和大数据序列建模的领域来燃烧一条小径。
Robots are traditionally bounded by a fixed embodiment during their operational lifetime, which limits their ability to adapt to their surroundings. Co-optimizing control and morphology of a robot, however, is often inefficient due to the complex interplay between the controller and morphology. In this paper, we propose a learning-based control method that can inherently take morphology into consideration such that once the control policy is trained in the simulator, it can be easily deployed to robots with different embodiments in the real world. In particular, we present the Embodiment-aware Transformer (EAT), an architecture that casts this control problem as conditional sequence modeling. EAT outputs the optimal actions by leveraging a causally masked Transformer. By conditioning an autoregressive model on the desired robot embodiment, past states, and actions, our EAT model can generate future actions that best fit the current robot embodiment. Experimental results show that EAT can outperform all other alternatives in embodiment-varying tasks, and succeed in an example of real-world evolution tasks: stepping down a stair through updating the morphology alone. We hope that EAT will inspire a new push toward real-world evolution across many domains, where algorithms like EAT can blaze a trail by bridging the field of evolutionary robotics and big data sequence modeling.