论文标题
关于深哈希方法的调查
A Survey on Deep Hashing Methods
论文作者
论文摘要
最近的邻居搜索旨在获取数据库中的样本,从它们到查询最小的距离,这是一系列字段(包括计算机视觉和数据挖掘)的基本任务。哈希是其计算和存储效率最广泛使用的方法之一。随着深度学习的发展,深度哈希方法比传统方法更有优势。在这项调查中,我们细节研究了当前的深层哈希算法,包括深度监督的哈希和无监督的哈希。具体而言,我们根据如何根据如何测量学习的哈希码的相似性来将深度监督的哈希方法分类为成对方法,基于排名的方法,点式方法以及量化。此外,深度无监督的散列被归类为基于相似性重建的方法,基于伪标签的方法和基于其语义学习方式的基于伪标签的方法和无预测的基于学习的学习方法。我们还介绍了三个相关的重要主题,包括半监督的深度散列,域名深层散列和多模式深哈希。同时,我们提出了一些常用的公共数据集以及测量深哈希算法的性能的计划。最后,我们讨论了一些潜在的研究方向。
Nearest neighbor search aims to obtain the samples in the database with the smallest distances from them to the queries, which is a basic task in a range of fields, including computer vision and data mining. Hashing is one of the most widely used methods for its computational and storage efficiency. With the development of deep learning, deep hashing methods show more advantages than traditional methods. In this survey, we detailedly investigate current deep hashing algorithms including deep supervised hashing and deep unsupervised hashing. Specifically, we categorize deep supervised hashing methods into pairwise methods, ranking-based methods, pointwise methods as well as quantization according to how measuring the similarities of the learned hash codes. Moreover, deep unsupervised hashing is categorized into similarity reconstruction-based methods, pseudo-label-based methods and prediction-free self-supervised learning-based methods based on their semantic learning manners. We also introduce three related important topics including semi-supervised deep hashing, domain adaption deep hashing and multi-modal deep hashing. Meanwhile, we present some commonly used public datasets and the scheme to measure the performance of deep hashing algorithms. Finally, we discuss some potential research directions in conclusion.