ralacs：使用互动编码和光流中自动驾驶汽车的动作识别

论文标题

ralacs：使用互动编码和光流中自动驾驶汽车的动作识别

RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical Flow

论文作者

Zhou, Eddy, Zhuang, Alex, Budhwani, Alikasim, Leather, Owen, Dempster, Rowan, Li, Quanquan, Al-Sharman, Mohammad, Rayside, Derek, Melek, William

论文摘要

当应用于自动驾驶汽车（AV）设置时，行动识别可以增强环境模型的情境意识。在传统的几何描述和AV中的启发式方法不足的情况下，这尤其普遍。但是，传统上已经对人类进行了行动识别，并且其对嘈杂，未击倒，未降低，原始RGB数据的适应性有限，限制了其在其他领域的应用。为了推动进步和采用行动识别为AVS，这项工作提出了一种新型的两阶段动作识别系统，称为Ralacs。拉拉克斯（Ralacs）提出了对道路场景的行动识别问题，并弥合了它与既定的人类行动识别领域之间的差距。这项工作表明，注意层如何对跨代理的关系有用，并强调这种方案如何具有类不可能。此外，为了解决道路上代理的动态性质，Ralacs构建了一种新颖的方法，以适应感兴趣区域（ROI）对齐对代理轨道进行下游动作分类。最后，我们的计划还考虑了主动剂检测的问题，并利用了融合光流图的新颖应用来辨别道路场景中相关的代理。我们表明，我们提出的方案可以胜过ICCV2021 Road Challenge数据集上的基线，并通过将其部署在真实的车辆平台上，我们提供了对决策中行动识别的有用性的初步见解。

When applied to autonomous vehicle (AV) settings, action recognition can enhance an environment model's situational awareness. This is especially prevalent in scenarios where traditional geometric descriptions and heuristics in AVs are insufficient. However, action recognition has traditionally been studied for humans, and its limited adaptability to noisy, un-clipped, un-pampered, raw RGB data has limited its application in other fields. To push for the advancement and adoption of action recognition into AVs, this work proposes a novel two-stage action recognition system, termed RALACs. RALACs formulates the problem of action recognition for road scenes, and bridges the gap between it and the established field of human action recognition. This work shows how attention layers can be useful for encoding the relations across agents, and stresses how such a scheme can be class-agnostic. Furthermore, to address the dynamic nature of agents on the road, RALACs constructs a novel approach to adapting Region of Interest (ROI) Alignment to agent tracks for downstream action classification. Finally, our scheme also considers the problem of active agent detection, and utilizes a novel application of fusing optical flow maps to discern relevant agents in a road scene. We show that our proposed scheme can outperform the baseline on the ICCV2021 Road Challenge dataset and by deploying it on a real vehicle platform, we provide preliminary insight to the usefulness of action recognition in decision making.

下载PDF全文

下载文献需遵守相关版权规定

论文标题