RET3D：重新思考对象关系以在驾驶场景中进行有效的3D对象检测

论文标题

RET3D：重新思考对象关系以在驾驶场景中进行有效的3D对象检测

Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

论文作者

Wu, Yu-Huan, Zhang, Da, Zhang, Le, Zhan, Xin, Dai, Dengxin, Liu, Yun, Cheng, Ming-Ming

论文摘要

目前缺乏基于LIDAR的检测框架在利用对象关系时自然而然地以空间和时间的方式存在。为此，我们引入了一个简单，高效且有效的两阶段检测器，称为RET3D。 RET3D的核心是利用新型的框架内和框架间关系模块，以相应地捕获空间和时间关系。更具体地说，框内关系模块（Intrarm）将框架内对象封装到稀疏图中，从而使我们能够通过有效的消息传递来完善对象特征。另一方面，框架间关系模块（Interm）密集地将每个对象动态连接到相应的跟踪序列中，并利用此类时间信息以通过轻量级变压器网络有效地增强其表示形式。我们通过基于中心的或基于锚的探测器将新颖的Intrarm和Interm设计实例化，并在Waymo Open DataSet（WOD）上对其进行评估。由于额外的额外费用可忽略不计，RET3D实现了最先进的性能，就1级和2级MAPH指标而言，在车辆检测方面分别比最近的竞争对手高出5.5％和3.2％。

Current efficient LiDAR-based detection frameworks are lacking in exploiting object relations, which naturally present in both spatial and temporal manners. To this end, we introduce a simple, efficient, and effective two-stage detector, termed as Ret3D. At the core of Ret3D is the utilization of novel intra-frame and inter-frame relation modules to capture the spatial and temporal relations accordingly. More Specifically, intra-frame relation module (IntraRM) encapsulates the intra-frame objects into a sparse graph and thus allows us to refine the object features through efficient message passing. On the other hand, inter-frame relation module (InterRM) densely connects each object in its corresponding tracked sequences dynamically, and leverages such temporal information to further enhance its representations efficiently through a lightweight transformer network. We instantiate our novel designs of IntraRM and InterRM with general center-based or anchor-based detectors and evaluate them on Waymo Open Dataset (WOD). With negligible extra overhead, Ret3D achieves the state-of-the-art performance, being 5.5% and 3.2% higher than the recent competitor in terms of the LEVEL 1 and LEVEL 2 mAPH metrics on vehicle detection, respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题