论文标题

一击对象检测而无需微调

One-Shot Object Detection without Fine-Tuning

论文作者

Li, Xiang, Zhang, Lin, Chen, Yau Pun, Tai, Yu-Wing, Tang, Chi-Keung

论文摘要

大规模数据集彻底改变了对象检测,但它们的对象类别仍然可以说是非常有限的。在本文中,我们试图通过解决单次对象检测问题来丰富此类类别,在该问题中,学习不见一类的带注释的培训示例的数量仅限于一个。我们引入了一个两阶段模型,该模型由第一阶段匹配的FCOS网络和第二阶段结构感知关系模块组成,该模块的组合将公制学习与无锚固速度更快的R-CNN风格检测管道集成在一起,最终消除了对支持图像进行微调的需求。我们还提出了新颖的培训策略,可有效提高检测性能。进行了广泛的定量和定性评估,我们的方法在多个数据集上始终超过最先进的单弹性性能。

Deep learning has revolutionized object detection thanks to large-scale datasets, but their object categories are still arguably very limited. In this paper, we attempt to enrich such categories by addressing the one-shot object detection problem, where the number of annotated training examples for learning an unseen class is limited to one. We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module, the combination of which integrates metric learning with an anchor-free Faster R-CNN-style detection pipeline, eventually eliminating the need to fine-tune on the support images. We also propose novel training strategies that effectively improve detection performance. Extensive quantitative and qualitative evaluations were performed and our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源