论文标题

用于机器人织物操纵策略的Sim2Real转移的学习切换标准

Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies

论文作者

Sharma, Satvik, Novoseller, Ellen, Viswanath, Vainavi, Javed, Zaynah, Parikh, Rishi, Hoque, Ryan, Balakrishna, Ashwin, Brown, Daniel S., Goldberg, Ken

论文摘要

模拟到现实的转移已成为一种流行且非常成功的方法,用于培训各种任务的机器人控制策略。但是,确定在模拟中训练的政策何时准备将其转移到物理世界通常是一个挑战。部署经过很少的模拟数据训练的策略可能会导致物理硬件上的不可靠和危险的行为。另一方面,模拟中的过度训练会导致策略过度拟合模拟器的视觉外观和动力学。在这项工作中,我们研究了自动确定在模拟中训练的政策何时可以可靠地转移到物理机器人的策略。我们在机器人织物操纵的背景下特别研究了这些思想,因为成功建模织物的动态和视觉外观的困难,成功的SIM2Real转移尤其具有挑战性。导致织物平滑任务的结果表明,我们的切换标准与实际的性能很好地相关。特别是,我们基于置信的切换标准在培训总预算的55-60%之内达到了87.2-93.7%的平均最终面料覆盖率。有关代码和补充材料,请参见https://tinyurl.com/lsc-case。

Simulation-to-reality transfer has emerged as a popular and highly successful method to train robotic control policies for a wide variety of tasks. However, it is often challenging to determine when policies trained in simulation are ready to be transferred to the physical world. Deploying policies that have been trained with very little simulation data can result in unreliable and dangerous behaviors on physical hardware. On the other hand, excessive training in simulation can cause policies to overfit to the visual appearance and dynamics of the simulator. In this work, we study strategies to automatically determine when policies trained in simulation can be reliably transferred to a physical robot. We specifically study these ideas in the context of robotic fabric manipulation, in which successful sim2real transfer is especially challenging due to the difficulties of precisely modeling the dynamics and visual appearance of fabric. Results in a fabric smoothing task suggest that our switching criteria correlate well with performance in real. In particular, our confidence-based switching criteria achieve average final fabric coverage of 87.2-93.7% within 55-60% of the total training budget. See https://tinyurl.com/lsc-case for code and supplemental materials.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源