论文标题
通过模仿动物学习敏捷的机器人运动技能
Learning Agile Robotic Locomotion Skills by Imitating Animals
论文作者
论文摘要
在机器人技术中,重现动物的多样化和敏捷的运动能力一直是一个长期的挑战。尽管手动设计的控制器能够模仿许多复杂的行为,但构建此类控制器涉及耗时且困难的开发过程,通常需要对每种技能的细微差别进行实质性的专业知识。强化学习为自动化控制器开发所涉及的手动努力提供了一种吸引人的替代方法。但是,设计出从代理商那里引起所需行为的学习目标也可能需要大量的技能专业知识。在这项工作中,我们提出了一个模仿学习系统,该系统使腿部机器人能够通过模仿现实世界动物来学习敏捷的运动技能。我们表明,通过利用参考运动数据,一种基于学习的方法能够自动为腿部机器人的多种曲目行为合成控制器。通过将样品有效的域自适应技术纳入培训过程,我们的系统能够在模拟中学习自适应策略,然后可以快速适应现实世界的部署。为了证明系统的有效性,我们训练一个18道四倍的机器人,以执行从不同的运动步态到动态啤酒花和转弯的各种敏捷行为。
Reproducing the diverse and agile locomotion skills of animals has been a longstanding challenge in robotics. While manually-designed controllers have been able to emulate many complex behaviors, building such controllers involves a time-consuming and difficult development process, often requiring substantial expertise of the nuances of each skill. Reinforcement learning provides an appealing alternative for automating the manual effort involved in the development of controllers. However, designing learning objectives that elicit the desired behaviors from an agent can also require a great deal of skill-specific expertise. In this work, we present an imitation learning system that enables legged robots to learn agile locomotion skills by imitating real-world animals. We show that by leveraging reference motion data, a single learning-based approach is able to automatically synthesize controllers for a diverse repertoire behaviors for legged robots. By incorporating sample efficient domain adaptation techniques into the training process, our system is able to learn adaptive policies in simulation that can then be quickly adapted for real-world deployment. To demonstrate the effectiveness of our system, we train an 18-DoF quadruped robot to perform a variety of agile behaviors ranging from different locomotion gaits to dynamic hops and turns.