论文标题
引理:用于学习多代理多任务活动的多视图数据集
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
论文作者
论文摘要
理解和解释人类行为是长期以来的挑战,也是人工智能感知的关键指标。但是,在先前的文献中,每日人类活动的一些当务之急,包括目标指导的行动,并发的多任务和多项式合作。我们介绍了引理数据集,以提供单个房屋,以精心设计的设置来解决这些缺失的维度,其中任务和代理的数量各不相同,以突出不同的学习目标。我们密集地注释了与人类相互作用的原子活化,以提供日常活动的组成性,调度和分配的基础真相。我们进一步设计了具有挑战性的组成行动识别和行动/任务预期基准,具有基线模型,以衡量组成行动理解和时间推理的能力。我们希望这项努力将推动机器愿景社区检查目标指导的人类活动,并进一步研究现实世界中的任务计划和分配。
Understanding and interpreting human actions is a long-standing challenge and a critical indicator of perception in artificial intelligence. However, a few imperative components of daily human activities are largely missed in prior literature, including the goal-directed actions, concurrent multi-tasks, and collaborations among multi-agents. We introduce the LEMMA dataset to provide a single home to address these missing dimensions with meticulously designed settings, wherein the number of tasks and agents varies to highlight different learning objectives. We densely annotate the atomic-actions with human-object interactions to provide ground-truths of the compositionality, scheduling, and assignment of daily activities. We further devise challenging compositional action recognition and action/task anticipation benchmarks with baseline models to measure the capability of compositional action understanding and temporal reasoning. We hope this effort would drive the machine vision community to examine goal-directed human activities and further study the task scheduling and assignment in the real world.