segvoxelnet：探索从点云检测3D车辆检测的语义上下文和深度感知功能

论文标题

segvoxelnet：探索从点云检测3D车辆检测的语义上下文和深度感知功能

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

论文作者

Yi, Hongwei, Shi, Shaoshuai, Ding, Mingyu, Sun, Jiankai, Xu, Kui, Zhou, Hui, Wang, Zhe, Li, Sheng, Wang, Guoping

论文摘要

基于点云的3D车辆检测是实际应用（例如自动驾驶）的一项艰巨任务。尽管取得了重大进展，但我们观察到两个方面要进一步改善。首先，在以前的工作中很少探索LiDAR中的语义上下文信息，这可能有助于识别模棱两可的车辆。其次，车辆上的点云的分布会随着深度的增加而连续变化，这可能无法通过单个模型来很好地建模。在这项工作中，我们提出了一个统一的模型Segvoxelnet，以解决上述两个问题。提出了一个语义上下文编码器，以利用鸟类视图中的免费语义分割面罩。可疑区域可以突出显示该模块被压抑嘈杂的区域。为了更好地处理不同深度的车辆，新颖的深度感知头旨在明确地对分布差异进行建模，并且深度感知头的每个部分都旨在集中在其自身的目标检测范围上。 KITTI数据集上的广泛实验表明，所提出的方法仅以点云为输入，在准确性和效率方面优于最先进的替代方案。

3D vehicle detection based on point cloud is a challenging task in real-world applications such as autonomous driving. Despite significant progress has been made, we observe two aspects to be further improved. First, the semantic context information in LiDAR is seldom explored in previous works, which may help identify ambiguous vehicles. Second, the distribution of point cloud on vehicles varies continuously with increasing depths, which may not be well modeled by a single model. In this work, we propose a unified model SegVoxelNet to address the above two problems. A semantic context encoder is proposed to leverage the free-of-charge semantic segmentation masks in the bird's eye view. Suspicious regions could be highlighted while noisy regions are suppressed by this module. To better deal with vehicles at different depths, a novel depth-aware head is designed to explicitly model the distribution differences and each part of the depth-aware head is made to focus on its own target detection range. Extensive experiments on the KITTI dataset show that the proposed method outperforms the state-of-the-art alternatives in both accuracy and efficiency with point cloud as input only.

下载PDF全文

下载文献需遵守相关版权规定

论文标题