PointNext：通过改进的培训和缩放策略重新访问PointNet ++

论文标题

PointNext：通过改进的培训和缩放策略重新访问PointNet ++

PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

论文作者

Qian, Guocheng, Li, Yuchen, Peng, Houwen, Mai, Jinjie, Hammoud, Hasan Abed Al Kader, Elhoseiny, Mohamed, Ghanem, Bernard

论文摘要

PointNet ++是Point Cloud理解的最具影响力的神经体系结构之一。尽管PointNet ++的准确性在很大程度上已被诸如PointMLP和Point Transformer等最近的网络超过，但我们发现，由于改进的培训策略，即数据增强和优化技术，并且增加了模型大小，而不是建筑创新，因此绩效增长的很大一部分是由于改进的培训策略所致。因此，PointNet ++的全部潜力尚未探索。在这项工作中，我们通过对模型培训和扩展策略进行系统的研究来重新审视经典的PointNet ++，并提供两个主要贡献。首先，我们提出了一系列改进的培训策略，可显着提高PointNet ++的性能。例如，我们表明，在架构上没有任何变化的情况下，ScanObjectnn对象分类上的PointNet ++的总体准确性（OA）可以从77.9％提高到86.1％，甚至超过了最先进的pointmlp。其次，我们将倒置的残留瓶颈设计和可分离的MLP引入到PointNet ++中，以实现有效的模型缩放，并提出了PointNext，这是PointNet的下一个版本。可以在3D分类和分割任务上灵活地扩展PointNext，并优于最先进的方法。对于分类，PointNext在ScanObjectnn上达到87.7的总体精度，超过了顶点2.3％，而推理速度快10倍。对于语义细分，PointNext建立了新的最新性能，在S3DIS上的平均值为74.9％（6倍交叉验证），优于最近的点变压器。代码和型号可在https://github.com/guochengqian/pointNext上找到。

PointNet++ is one of the most influential neural architectures for point cloud understanding. Although the accuracy of PointNet++ has been largely surpassed by recent networks such as PointMLP and Point Transformer, we find that a large portion of the performance gain is due to improved training strategies, i.e. data augmentation and optimization techniques, and increased model sizes rather than architectural innovations. Thus, the full potential of PointNet++ has yet to be explored. In this work, we revisit the classical PointNet++ through a systematic study of model training and scaling strategies, and offer two major contributions. First, we propose a set of improved training strategies that significantly improve PointNet++ performance. For example, we show that, without any change in architecture, the overall accuracy (OA) of PointNet++ on ScanObjectNN object classification can be raised from 77.9% to 86.1%, even outperforming state-of-the-art PointMLP. Second, we introduce an inverted residual bottleneck design and separable MLPs into PointNet++ to enable efficient and effective model scaling and propose PointNeXt, the next version of PointNets. PointNeXt can be flexibly scaled up and outperforms state-of-the-art methods on both 3D classification and segmentation tasks. For classification, PointNeXt reaches an overall accuracy of 87.7 on ScanObjectNN, surpassing PointMLP by 2.3%, while being 10x faster in inference. For semantic segmentation, PointNeXt establishes a new state-of-the-art performance with 74.9% mean IoU on S3DIS (6-fold cross-validation), being superior to the recent Point Transformer. The code and models are available at https://github.com/guochengqian/pointnext.

下载PDF全文

下载文献需遵守相关版权规定

论文标题