论文标题
IA-MOT:具有运动一致性的实例感知多目标跟踪
IA-MOT: Instance-Aware Multi-Object Tracking with Motion Consistency
论文作者
论文摘要
多对象跟踪(MOT)是计算机视觉社会中的至关重要任务。但是,大多数具有可用检测到的边界框的逐探MoT方法无法有效地处理静态,缓慢和快速移动的相机场景,因为自我动作和频繁的闭塞。在这项工作中,我们提出了一个新颖的跟踪框架,称为“实例吸引MOT”(ia-mot),可以通过共同考虑实例级级特征和对象运动来跟踪静态或移动摄像机中的多个对象。首先,通过将给定的检测作为区域建议发送,从带有额外嵌入头的蒙版R-CNN检测器的变体中提取出色的外观特征。同时,从给定的实例掩码生成,并应用于提取的嵌入功能,该空间注意力集中在边界框中的前景上。在跟踪阶段,使用匈牙利协会算法通过特征相似性和运动一致性来对齐对象实例掩模。此外,对象重新识别(REID)被合并,以恢复由长期阻塞或缺失检测引起的ID开关。总体而言,当对MOTS20和KITTI-MOTS数据集进行评估时,我们提出的方法在CVPR2020讲习班中赢得了BMTT挑战赛3的第一名。
Multiple object tracking (MOT) is a crucial task in computer vision society. However, most tracking-by-detection MOT methods, with available detected bounding boxes, cannot effectively handle static, slow-moving and fast-moving camera scenarios simultaneously due to ego-motion and frequent occlusion. In this work, we propose a novel tracking framework, called "instance-aware MOT" (IA-MOT), that can track multiple objects in either static or moving cameras by jointly considering the instance-level features and object motions. First, robust appearance features are extracted from a variant of Mask R-CNN detector with an additional embedding head, by sending the given detections as the region proposals. Meanwhile, the spatial attention, which focuses on the foreground within the bounding boxes, is generated from the given instance masks and applied to the extracted embedding features. In the tracking stage, object instance masks are aligned by feature similarity and motion consistency using the Hungarian association algorithm. Moreover, object re-identification (ReID) is incorporated to recover ID switches caused by long-term occlusion or missing detection. Overall, when evaluated on the MOTS20 and KITTI-MOTS dataset, our proposed method won the first place in Track 3 of the BMTT Challenge in CVPR2020 workshops.