一击对象检测而无需微调

论文标题

一击对象检测而无需微调

One-Shot Object Detection without Fine-Tuning

论文作者

Li, Xiang, Zhang, Lin, Chen, Yau Pun, Tai, Yu-Wing, Tang, Chi-Keung

论文摘要

大规模数据集彻底改变了对象检测，但它们的对象类别仍然可以说是非常有限的。在本文中，我们试图通过解决单次对象检测问题来丰富此类类别，在该问题中，学习不见一类的带注释的培训示例的数量仅限于一个。我们引入了一个两阶段模型，该模型由第一阶段匹配的FCOS网络和第二阶段结构感知关系模块组成，该模块的组合将公制学习与无锚固速度更快的R-CNN风格检测管道集成在一起，最终消除了对支持图像进行微调的需求。我们还提出了新颖的培训策略，可有效提高检测性能。进行了广泛的定量和定性评估，我们的方法在多个数据集上始终超过最先进的单弹性性能。

Deep learning has revolutionized object detection thanks to large-scale datasets, but their object categories are still arguably very limited. In this paper, we attempt to enrich such categories by addressing the one-shot object detection problem, where the number of annotated training examples for learning an unseen class is limited to one. We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module, the combination of which integrates metric learning with an anchor-free Faster R-CNN-style detection pipeline, eventually eliminating the need to fine-tune on the support images. We also propose novel training strategies that effectively improve detection performance. Extensive quantitative and qualitative evaluations were performed and our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题