论文标题
一击对象检测而无需微调
One-Shot Object Detection without Fine-Tuning
论文作者
论文摘要
大规模数据集彻底改变了对象检测,但它们的对象类别仍然可以说是非常有限的。在本文中,我们试图通过解决单次对象检测问题来丰富此类类别,在该问题中,学习不见一类的带注释的培训示例的数量仅限于一个。我们引入了一个两阶段模型,该模型由第一阶段匹配的FCOS网络和第二阶段结构感知关系模块组成,该模块的组合将公制学习与无锚固速度更快的R-CNN风格检测管道集成在一起,最终消除了对支持图像进行微调的需求。我们还提出了新颖的培训策略,可有效提高检测性能。进行了广泛的定量和定性评估,我们的方法在多个数据集上始终超过最先进的单弹性性能。
Deep learning has revolutionized object detection thanks to large-scale datasets, but their object categories are still arguably very limited. In this paper, we attempt to enrich such categories by addressing the one-shot object detection problem, where the number of annotated training examples for learning an unseen class is limited to one. We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module, the combination of which integrates metric learning with an anchor-free Faster R-CNN-style detection pipeline, eventually eliminating the need to fine-tune on the support images. We also propose novel training strategies that effectively improve detection performance. Extensive quantitative and qualitative evaluations were performed and our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.