加速机器人学习的接触式操纵：课程学习研究

论文标题

加速机器人学习的接触式操纵：课程学习研究

Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning Study

论文作者

Beltran-Hernandez, Cristian C., Petit, Damien, Ramirez-Alpizar, Ixchel G., Harada, Kensuke

论文摘要

加固学习（RL）范式一直是自动化机器人任务的重要工具。尽管RL取得了进步，但由于需要与环境进行昂贵的大量机器人互动，因此该行业仍未广泛采用。已经提出了课程学习（CL）来加快学习。但是，大多数研究工作仅在模拟环境中进行评估，从视频游戏到机器人玩具任务。本文介绍了一项研究，用于加速基于课程学习与域随机分组（DR）的机器人学习富含接触的操纵任务的研究。我们使用位置控制的机器人（例如插入任务）来处理复杂的工业集会任务。我们比较了DR的不同课程设计和采样方法。基于这项研究，我们提出了一种明显优于先前的工作的方法，该方法仅使用DR（不使用CL），而训练时间的五分之一（样品）。结果还表明，即使仅在模拟玩具任务进行培训时，我们的方法也可以学习可以转移到现实世界机器人的政策。在培训期间，博学的政策在现实世界中复杂的工业插入任务（公差为$ \ pm 0.01〜mm $）上达到了多达86％的成功率。

The Reinforcement Learning (RL) paradigm has been an essential tool for automating robotic tasks. Despite the advances in RL, it is still not widely adopted in the industry due to the need for an expensive large amount of robot interaction with its environment. Curriculum Learning (CL) has been proposed to expedite learning. However, most research works have been only evaluated in simulated environments, from video games to robotic toy tasks. This paper presents a study for accelerating robot learning of contact-rich manipulation tasks based on Curriculum Learning combined with Domain Randomization (DR). We tackle complex industrial assembly tasks with position-controlled robots, such as insertion tasks. We compare different curricula designs and sampling approaches for DR. Based on this study, we propose a method that significantly outperforms previous work, which uses DR only (No CL is used), with less than a fifth of the training time (samples). Results also show that even when training only in simulation with toy tasks, our method can learn policies that can be transferred to the real-world robot. The learned policies achieved success rates of up to 86\% on real-world complex industrial insertion tasks (with tolerances of $\pm 0.01~mm$) not seen during the training.

下载PDF全文

下载文献需遵守相关版权规定

论文标题