学习任务不足的动作空间用于运动优化

论文标题

学习任务不足的动作空间用于运动优化

Learning Task-Agnostic Action Spaces for Movement Optimization

论文作者

Babadi, Amin, van de Panne, Michiel, Liu, C. Karen, Hämäläinen, Perttu

论文摘要

我们提出了一种新颖的方法，用于探索基于物理的动画角色的动力学，并学习一个任务无关的动作空间，使运动优化更容易。像以前的几篇论文一样，我们将动作作为目标状态进行参数化，并学习一个胜过目标的低级控制策略，从而将代理商的状态驱动到目标。我们的新颖贡献是，借助我们的勘探数据，我们能够以通用的方式学习低级政策，而无需任何参考移动数据。该策略曾为每个代理商或模拟环境进行一次培训，提高了在多个任务和优化算法中优化轨迹和高级政策的效率。我们还贡献了新颖的可视化，这些可视化表明如何将目标状态用作动作使优化的轨迹对干扰更强大。这表现为易于找到的更广泛的优点。由于其简单性和一般性，我们提出的方法应提供一个可以改善各种运动优化方法和应用的组成部分。

We propose a novel method for exploring the dynamics of physically based animated characters, and learning a task-agnostic action space that makes movement optimization easier. Like several previous papers, we parameterize actions as target states, and learn a short-horizon goal-conditioned low-level control policy that drives the agent's state towards the targets. Our novel contribution is that with our exploration data, we are able to learn the low-level policy in a generic manner and without any reference movement data. Trained once for each agent or simulation environment, the policy improves the efficiency of optimizing both trajectories and high-level policies across multiple tasks and optimization algorithms. We also contribute novel visualizations that show how using target states as actions makes optimized trajectories more robust to disturbances; this manifests as wider optima that are easy to find. Due to its simplicity and generality, our proposed approach should provide a building block that can improve a large variety of movement optimization methods and applications.

下载PDF全文

下载文献需遵守相关版权规定

论文标题