了解图像检索重新排列：图形神经网络视角

论文标题

了解图像检索重新排列：图形神经网络视角

Understanding Image Retrieval Re-Ranking: A Graph Neural Network Perspective

论文作者

Zhang, Xuanmeng, Jiang, Minyue, Zheng, Zhedong, Tan, Xiao, Ding, Errui, Yang, Yi

论文摘要

重新排列的方法利用了高信心检索样品来完善检索结果，这些结果已被广泛用作图像检索任务的后处理工具。但是，我们注意到重新排列的一个主要缺陷，即高计算复杂性，这导致了现实世界应用的无法负担的时间成本。在本文中，我们重新访问了重新排列并证明可以将重新排列重新列为高平行性图形神经网络（GNN）函数。特别是，我们将传统的重新排行过程分为两个阶段，即检索高质量的画廊样本和更新功能。我们认为，第一阶段等于构建K-Nearest邻居图，而第二阶段可以看作是在图中传播消息。实际上，GNN只需要与连接的边缘有关顶点。由于图很少，因此我们可以有效地更新顶点功能。在Market-1501数据集上，我们使用一个K40M GPU加速了重新排列的处理，从89.2s到9.4ms，从而促进了实时后处理。同样，我们观察到，我们的方法在其他四个图像检索基准测试基准（即Veri-776，牛津5K，巴黎6K和1652年）上获得了可比甚至更好的检索结果，并具有有限的时间成本。我们的代码公开可用。

The re-ranking approach leverages high-confidence retrieved samples to refine retrieval results, which have been widely adopted as a post-processing tool for image retrieval tasks. However, we notice one main flaw of re-ranking, i.e., high computational complexity, which leads to an unaffordable time cost for real-world applications. In this paper, we revisit re-ranking and demonstrate that re-ranking can be reformulated as a high-parallelism Graph Neural Network (GNN) function. In particular, we divide the conventional re-ranking process into two phases, i.e., retrieving high-quality gallery samples and updating features. We argue that the first phase equals building the k-nearest neighbor graph, while the second phase can be viewed as spreading the message within the graph. In practice, GNN only needs to concern vertices with the connected edges. Since the graph is sparse, we can efficiently update the vertex features. On the Market-1501 dataset, we accelerate the re-ranking processing from 89.2s to 9.4ms with one K40m GPU, facilitating the real-time post-processing. Similarly, we observe that our method achieves comparable or even better retrieval results on the other four image retrieval benchmarks, i.e., VeRi-776, Oxford-5k, Paris-6k and University-1652, with limited time cost. Our code is publicly available.

下载PDF全文

下载文献需遵守相关版权规定

论文标题