基于图像的3D对象检测的伪标记的实证研究

论文标题

基于图像的3D对象检测的伪标记的实证研究

An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection

论文作者

Ma, Xinzhu, Meng, Yuan, Zhang, Yinmin, Bai, Lei, Hou, Jun, Yi, Shuai, Ouyang, Wanli

论文摘要

基于图像的3D检测是自动驾驶感知系统的必不可少的组成部分。但是，它仍然受到不满意的表现，这是有限的培训数据的主要原因之一。不幸的是，对3D空间中的对象注释非常耗时/资源，这使得很难任意扩展训练集。在这项工作中，我们专注于半监督的方式，并探索更便宜的替代方案（即伪标记）的可行性，以利用未标记的数据。为此，我们进行了广泛的实验，以研究伪标签是否可以在不同环境下为基线模型提供有效的监督。实验结果不仅证明了基于图像的3D检测的伪标记机制的有效性（例如，在单眼环境下，我们在Kitti-3D测试中实现了20.23 AP，而无需铃声和哨声设置的中等水平，但在6.03 AP中将基线模型提高了几个有趣的模型），并且在培训中培训了多个模型（E.G。基于相同培训数据的地面真相注释）。我们希望这项工作可以为基于图像的3D检测界提供见解，在半监督的环境下。代码，伪标签和预培训模型将公开可用。

Image-based 3D detection is an indispensable component of the perception system for autonomous driving. However, it still suffers from the unsatisfying performance, one of the main reasons for which is the limited training data. Unfortunately, annotating the objects in the 3D space is extremely time/resource-consuming, which makes it hard to extend the training set arbitrarily. In this work, we focus on the semi-supervised manner and explore the feasibility of a cheaper alternative, i.e. pseudo-labeling, to leverage the unlabeled data. For this purpose, we conduct extensive experiments to investigate whether the pseudo-labels can provide effective supervision for the baseline models under varying settings. The experimental results not only demonstrate the effectiveness of the pseudo-labeling mechanism for image-based 3D detection (e.g. under monocular setting, we achieve 20.23 AP for moderate level on the KITTI-3D testing set without bells and whistles, improving the baseline model by 6.03 AP), but also show several interesting and noteworthy findings (e.g. the models trained with pseudo-labels perform better than that trained with ground-truth annotations based on the same training data). We hope this work can provide insights for the image-based 3D detection community under a semi-supervised setting. The codes, pseudo-labels, and pre-trained models will be publicly available.

下载PDF全文

下载文献需遵守相关版权规定

论文标题