复杂互动行为的级联构图残差学习

论文标题

复杂互动行为的级联构图残差学习

Cascaded Compositional Residual Learning for Complex Interactive Behaviors

论文作者

Kumar, K. Niranjan, Essa, Irfan, Ha, Sehoon

论文摘要

现实世界中的自主任务通常需要与附近物体（例如门或开关）以及有效导航进行丰富的互动。但是，这种复杂的行为很难学习，因为它们涉及高级计划和低级运动控制。我们提出了一个新颖的框架，即级联的构图残差学习（CCRL），该框架通过递归利用先前学习的控制策略的库来学习复合技能。我们的框架可以同时学习乘法策略组成，特定于任务的残差操作和综合目标信息，同时冻结先决条件的政策。我们通过规范剩余动作来进一步明确控制运动的样式。我们表明，我们的框架学习了一套从基本的运动到复杂的互动导航，包括围绕障碍物导航，推动物体，爬在桌子下方，用腿打开门，并在穿过它时保持其打开状态。拟议的CCRL框架导致具有一致样式和较低关节扭矩的政策，我们成功地将其转移到了真正的Unitree A1机器人，而无需进行任何其他微调。

Real-world autonomous missions often require rich interaction with nearby objects, such as doors or switches, along with effective navigation. However, such complex behaviors are difficult to learn because they involve both high-level planning and low-level motor control. We present a novel framework, Cascaded Compositional Residual Learning (CCRL), which learns composite skills by recursively leveraging a library of previously learned control policies. Our framework learns multiplicative policy composition, task-specific residual actions, and synthetic goal information simultaneously while freezing the prerequisite policies. We further explicitly control the style of the motion by regularizing residual actions. We show that our framework learns joint-level control policies for a diverse set of motor skills ranging from basic locomotion to complex interactive navigation, including navigating around obstacles, pushing objects, crawling under a table, pushing a door open with its leg, and holding it open while walking through it. The proposed CCRL framework leads to policies with consistent styles and lower joint torques, which we successfully transfer to a real Unitree A1 robot without any additional fine-tuning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题