论文标题
在板载深对象检测的共同训练
Co-training for On-board Deep Object Detection
论文作者
论文摘要
多年来,为训练视觉模型提供地面真相监督一直是一种瓶颈,这会因域名变化而加剧,这种域变化使这种模型的性能退化。当视觉任务依赖于手工制作的功能和浅机器学习时,尽管其前所未有的性能提高,但由于其渴望数据的性质,该问题在深度学习范式中仍然存在。最佳性能深远的基于视力的对象探测器是通过依靠人类标记的边界框来培训的,这些框将训练图像中的集体实例(即对象)定位。因此,对象检测是人类标记是主要瓶颈的此类任务之一。在本文中,我们将共同训练评估为未标记图像中的自标记对象的半监督学习方法,从而减少了为开发深对象检测器的人体标记的工作。我们的研究特别注意涉及域转移的情况。特别是,当我们自动生成具有对象边界框的虚拟世界映像时,我们就有未标记的真实图像。此外,我们特别有兴趣在驾驶员援助系统和/或自动驾驶汽车的背景下使用共同训练进行深度对象检测。因此,在这些应用程序上下文中,使用良好的数据集和协议进行对象检测,我们将展示如何共同训练是一个值得追求的范式,以减轻对象标记,并与任务无关的域适应性一起工作。
Providing ground truth supervision to train visual models has been a bottleneck over the years, exacerbated by domain shifts which degenerate the performance of such models. This was the case when visual tasks relied on handcrafted features and shallow machine learning and, despite its unprecedented performance gains, the problem remains open within the deep learning paradigm due to its data-hungry nature. Best performing deep vision-based object detectors are trained in a supervised manner by relying on human-labeled bounding boxes which localize class instances (i.e.objects) within the training images.Thus, object detection is one of such tasks for which human labeling is a major bottleneck. In this paper, we assess co-training as a semi-supervised learning method for self-labeling objects in unlabeled images, so reducing the human-labeling effort for developing deep object detectors. Our study pays special attention to a scenario involving domain shift; in particular, when we have automatically generated virtual-world images with object bounding boxes and we have real-world images which are unlabeled. Moreover, we are particularly interested in using co-training for deep object detection in the context of driver assistance systems and/or self-driving vehicles. Thus, using well-established datasets and protocols for object detection in these application contexts, we will show how co-training is a paradigm worth to pursue for alleviating object labeling, working both alone and together with task-agnostic domain adaptation.