引理：用于学习多代理多任务活动的多视图数据集

论文标题

引理：用于学习多代理多任务活动的多视图数据集

LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities

论文作者

Jia, Baoxiong, Chen, Yixin, Huang, Siyuan, Zhu, Yixin, Zhu, Song-chun

论文摘要

理解和解释人类行为是长期以来的挑战，也是人工智能感知的关键指标。但是，在先前的文献中，每日人类活动的一些当务之急，包括目标指导的行动，并发的多任务和多项式合作。我们介绍了引理数据集，以提供单个房屋，以精心设计的设置来解决这些缺失的维度，其中任务和代理的数量各不相同，以突出不同的学习目标。我们密集地注释了与人类相互作用的原子活化，以提供日常活动的组成性，调度和分配的基础真相。我们进一步设计了具有挑战性的组成行动识别和行动/任务预期基准，具有基线模型，以衡量组成行动理解和时间推理的能力。我们希望这项努力将推动机器愿景社区检查目标指导的人类活动，并进一步研究现实世界中的任务计划和分配。

Understanding and interpreting human actions is a long-standing challenge and a critical indicator of perception in artificial intelligence. However, a few imperative components of daily human activities are largely missed in prior literature, including the goal-directed actions, concurrent multi-tasks, and collaborations among multi-agents. We introduce the LEMMA dataset to provide a single home to address these missing dimensions with meticulously designed settings, wherein the number of tasks and agents varies to highlight different learning objectives. We densely annotate the atomic-actions with human-object interactions to provide ground-truths of the compositionality, scheduling, and assignment of daily activities. We further devise challenging compositional action recognition and action/task anticipation benchmarks with baseline models to measure the capability of compositional action understanding and temporal reasoning. We hope this effort would drive the machine vision community to examine goal-directed human activities and further study the task scheduling and assignment in the real world.

下载PDF全文

下载文献需遵守相关版权规定

论文标题