论文标题
更深入地识别黑暗环境中的行动:一项全面的基准研究
Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive Benchmark Study
论文作者
论文摘要
尽管通过引入大型视频数据集并开发了深层神经网络,但动作识别(AR)已取得了很大的改进,但在实际情况下,AR模型对挑战性环境的强大模型仍未得到探索。我们专注于黑暗环境中的行动识别任务,该任务可以应用于夜间监视和自动驾驶等领域。直观地,当前的深网以及视觉增强技术应该能够在黑暗环境中处理AR,但是,在实践中,情况并非总是如此。为了深入研究黑暗环境中AR的解决方案,我们在IEEE CVPR 2021中启动了UG2+挑战赛2(UG2-2),目的是评估和推进黑暗环境中AR模型的稳健性。挑战是在一个新颖的干旱数据集,第一个用于Dark Video AR任务的数据集上建立和扩展,并指导模型以完全和半监督的方式处理此类任务。报告了利用当前AR模型和增强方法的基线结果,证明了这项任务的挑战性质,并具有很大的改进空间。得益于研究界的积极参与,参与者的解决方案已经取得了显着进步,而对这些解决方案的分析有助于更好地确定在黑暗环境中应对AR挑战的可能方向。
While action recognition (AR) has gained large improvements with the introduction of large-scale video datasets and the development of deep neural networks, AR models robust to challenging environments in real-world scenarios are still under-explored. We focus on the task of action recognition in dark environments, which can be applied to fields such as surveillance and autonomous driving at night. Intuitively, current deep networks along with visual enhancement techniques should be able to handle AR in dark environments, however, it is observed that this is not always the case in practice. To dive deeper into exploring solutions for AR in dark environments, we launched the UG2+ Challenge Track 2 (UG2-2) in IEEE CVPR 2021, with a goal of evaluating and advancing the robustness of AR models in dark environments. The challenge builds and expands on top of a novel ARID dataset, the first dataset for the task of dark video AR, and guides models to tackle such a task in both fully and semi-supervised manners. Baseline results utilizing current AR models and enhancement methods are reported, justifying the challenging nature of this task with substantial room for improvements. Thanks to the active participation from the research community, notable advances have been made in participants' solutions, while analysis of these solutions helped better identify possible directions to tackle the challenge of AR in dark environments.