论文标题
PBRNET:锥体边界框的完善,以提高对象定位精度
PBRnet: Pyramidal Bounding Box Refinement to Improve Object Localization Accuracy
论文作者
论文摘要
最近,许多开发的对象检测器集中在粗到精细的框架上,该框架包含了几个阶段,这些阶段将提案分类和回归从粗粒细粒度为细粒度,并逐渐获得更准确的检测。多分辨率模型(例如特征金字塔网络(FPN))整合了不同级别的分辨率信息,并有效地改善了性能。先前的研究还表明,可以通过以下方式进一步改善本地化:1)使用更翻译变体的细粒度信息; 2)更专注于本地边界信息的地方区域。基于这些原则,我们设计了一种新型的边界精炼架构,以通过将粗到1的框架与特征金字塔结构相结合,称为金字塔界边界框精炼网络(PBRNET),该结构逐渐参数逐渐凝视对象的边界区域,并利用较低的级别特征图来提取菲尔本地信息,以提高预测的框架。在MS-Coco数据集上进行了广泛的实验。当添加到FPN或Libra R-CNN中时,PBRNet的性能增长大约3点$ MAP $。此外,通过将Cascade R-CNN视为粗到1的探测器,并通过PBRNet的回归剂代替其本地化分支,它可以使额外的性能提高1.5 $ MAP $,从而使总绩效提高到5点$ $ MAP $。
Many recently developed object detectors focused on coarse-to-fine framework which contains several stages that classify and regress proposals from coarse-grain to fine-grain, and obtains more accurate detection gradually. Multi-resolution models such as Feature Pyramid Network(FPN) integrate information of different levels of resolution and effectively improve the performance. Previous researches also have revealed that localization can be further improved by: 1) using fine-grained information which is more translational variant; 2) refining local areas which is more focused on local boundary information. Based on these principles, we designed a novel boundary refinement architecture to improve localization accuracy by combining coarse-to-fine framework with feature pyramid structure, named as Pyramidal Bounding Box Refinement network(PBRnet), which parameterizes gradually focused boundary areas of objects and leverages lower-level feature maps to extract finer local information when refining the predicted bounding boxes. Extensive experiments are performed on the MS-COCO dataset. The PBRnet brings a significant performance gains by roughly 3 point of $mAP$ when added to FPN or Libra R-CNN. Moreover, by treating Cascade R-CNN as a coarse-to-fine detector and replacing its localization branch by the regressor of PBRnet, it leads an extra performance improvement by 1.5 $mAP$, yielding a total performance boosting by as high as 5 point of $mAP$.