在加强学习中进行技能转移的层次结构开始

论文标题

在加强学习中进行技能转移的层次结构开始

Hierarchical Kickstarting for Skill Transfer in Reinforcement Learning

论文作者

Matthews, Michael, Samvelyan, Mikayel, Parker-Holder, Jack, Grefenstette, Edward, Rocktäschel, Tim

论文摘要

练习和磨练技能构成了人类学习方式的基本组成部分，但是人工代理很少专门培训以执行它们。取而代之的是，它们通常是端到端训练的，希望将有用的技能隐含地学习，以最大程度地提高某些外部奖励功能的折扣回报。在本文中，我们研究了如何将技能纳入具有较大州行动空间和稀疏奖励的复杂环境中的增强学习（RL）训练中。为此，我们创建了SkillHack，这是Nethack游戏的任务和相关技能的基准。我们在此基准上评估了许多基准，以及我们自己的新型基于技能的方法分层启动（HKS），该方法表现出优于所有其他评估的方法。我们的实验表明，先验了解有用技能的学习可以显着提高代理在复杂问题上的表现。我们最终认为，利用预定义的技能为RL问题提供了有用的归纳偏见，尤其是那些具有较大国家行动空间和稀疏奖励的问题。

Practising and honing skills forms a fundamental component of how humans learn, yet artificial agents are rarely specifically trained to perform them. Instead, they are usually trained end-to-end, with the hope being that useful skills will be implicitly learned in order to maximise discounted return of some extrinsic reward function. In this paper, we investigate how skills can be incorporated into the training of reinforcement learning (RL) agents in complex environments with large state-action spaces and sparse rewards. To this end, we created SkillHack, a benchmark of tasks and associated skills based on the game of NetHack. We evaluate a number of baselines on this benchmark, as well as our own novel skill-based method Hierarchical Kickstarting (HKS), which is shown to outperform all other evaluated methods. Our experiments show that learning with a prior knowledge of useful skills can significantly improve the performance of agents on complex problems. We ultimately argue that utilising predefined skills provides a useful inductive bias for RL problems, especially those with large state-action spaces and sparse rewards.

下载PDF全文

下载文献需遵守相关版权规定

论文标题