论文标题
Sture:在线多对象跟踪中鲁棒数据关联的时空相互表示学习
STURE: Spatial-Temporal Mutual Representation Learning for Robust Data Association in Online Multi-Object Tracking
论文作者
论文摘要
在线多对象跟踪(MOT)是计算机视觉和智能车辆平台的长期任务。目前,主要的范式是逐项跟踪,该范式的主要困难是如何将当前候选探测与历史踪迹联系起来。但是,在MOT场景中,每个历史曲目都由对象序列组成,而每个候选检测只是一个平坦的图像,它缺乏对象序列的时间特征。当前候选探测和历史踪迹之间的特征差异使对象关联变得更加困难。因此,我们提出了一种时空相互表示学习(Sture)方法,该方法在相互表示空间中学习了当前候选检测与历史序列之间的时空表示。对于历史跟踪器,检测学习网络被迫匹配相互表示空间中序列学习网络的表示。所提出的方法能够通过使用对象关联中的各种设计损失来提取更多区分检测和序列表示。结果,空间时期特征是相互学习的,以增强当前的检测特征,并且可以缓解特征差异。为了证明坚固的鲁棒性,它适用于公共MOT挑战基准,并且与基于身份保护指标的各种最新的在线MOT跟踪器相比,它的性能很好。
Online multi-object tracking (MOT) is a longstanding task for computer vision and intelligent vehicle platform. At present, the main paradigm is tracking-by-detection, and the main difficulty of this paradigm is how to associate current candidate detections with historical tracklets. However, in the MOT scenarios, each historical tracklet is composed of an object sequence, while each candidate detection is just a flat image, which lacks temporal features of the object sequence. The feature difference between current candidate detections and historical tracklets makes the object association much harder. Therefore, we propose a Spatial-Temporal Mutual Representation Learning (STURE) approach which learns spatial-temporal representations between current candidate detections and historical sequences in a mutual representation space. For historical trackelets, the detection learning network is forced to match the representations of sequence learning network in a mutual representation space. The proposed approach is capable of extracting more distinguishing detection and sequence representations by using various designed losses in object association. As a result, spatial-temporal feature is learned mutually to reinforce the current detection features, and the feature difference can be relieved. To prove the robustness of the STURE, it is applied to the public MOT challenge benchmarks and performs well compared with various state-of-the-art online MOT trackers based on identity-preserving metrics.