学习运动技能的混合控制

论文标题

学习运动技能的混合控制

Hybrid Control for Learning Motor Skills

论文作者

Abraham, Ian, Broad, Alexander, Pinosky, Allison, Argall, Brenna, Murphey, Todd D.

论文摘要

我们为机器人学习开发了一种混合控制方法，基于将学习的预测模型与基于经验的国家行动策略映射相结合，以提高机器人系统的学习能力。预测模型提供了对任务和物理学的理解（可以提高样本效率），而基于经验的政策映射被视为“肌肉记忆”，将有利的动作编码为覆盖计划行动的经验。混合控制工具用于创建一种将学习的预测模型与基于经验的学习相结合的算法方法。混合学习是通过系统地结合和改善预测模型和基于经验的政策的性能来有效地学习运动技能的一种方法。杂种学习的确定性变化被得出并扩展为随机实现，从而放松了原始派生中的某些关键假设。每种变化均可在基于经验的学习方法（机器人与环境相互作用以获得经验）以及模仿学习方法（通过演示提供经验并在环境中进行测试）进行测试。结果表明，我们的方法能够提高各种实验领域学习运动技能的性能和样本效率。

We develop a hybrid control approach for robot learning based on combining learned predictive models with experience-based state-action policy mappings to improve the learning capabilities of robotic systems. Predictive models provide an understanding of the task and the physics (which improves sample-efficiency), while experience-based policy mappings are treated as "muscle memory" that encode favorable actions as experiences that override planned actions. Hybrid control tools are used to create an algorithmic approach for combining learned predictive models with experience-based learning. Hybrid learning is presented as a method for efficiently learning motor skills by systematically combining and improving the performance of predictive models and experience-based policies. A deterministic variation of hybrid learning is derived and extended into a stochastic implementation that relaxes some of the key assumptions in the original derivation. Each variation is tested on experience-based learning methods (where the robot interacts with the environment to gain experience) as well as imitation learning methods (where experience is provided through demonstrations and tested in the environment). The results show that our method is capable of improving the performance and sample-efficiency of learning motor skills in a variety of experimental domains.

下载PDF全文

下载文献需遵守相关版权规定

论文标题