使用小波来分析图像分类数据集中的相似性

论文标题

使用小波来分析图像分类数据集中的相似性

Using Wavelets to Analyze Similarities in Image-Classification Datasets

论文作者

Yousefzadeh, Roozbeh

论文摘要

深度学习图像分类器通常依靠巨大的训练集，他们的培训过程可以描述为学习训练图像之间的相似性和差异。但是，通常不会从这个角度研究大型训练集中的图像，并且通常会忽略图像之间的相似性和差异。这是由于缺乏快速有效的计算方法来分析这些数据集的内容。一些研究旨在确定有影响力和冗余的训练图像，但是这种方法需要已经在整个训练集中进行培训的模型。在这里，使用图像处理和数值分析工具，我们开发了一种实用而快速的方法来分析图像分类数据集中的相似性。我们表明，在训练模型之前，这种分析可以提供有关数据集和当前的分类任务的宝贵见解。我们的方法使用图像和其他数值分析工具的小波分解，而无需进行预训练的模型。有趣的是，我们获得的结果证实了文献中先前的结果，这些结果使用预训练的CNN分析了相似性。我们表明，可以在几秒钟内确定标准数据集（例如CIFAR）中的类似图像，与文献中的替代方法相比，这是显着的加速。通过消除计算速度障碍，获得有关数据集内容及其训练的模型的新见解变得可行。我们表明，训练和测试图像之间的相似性可能会提供有关模型概括的见解。最后，我们研究了图像与受过训练模型的决策边界有关的相似性。

Deep learning image classifiers usually rely on huge training sets and their training process can be described as learning the similarities and differences among training images. But, images in large training sets are not usually studied from this perspective and fine-level similarities and differences among images is usually overlooked. This is due to lack of fast and efficient computational methods to analyze the contents of these datasets. Some studies aim to identify the influential and redundant training images, but such methods require a model that is already trained on the entire training set. Here, using image processing and numerical analysis tools we develop a practical and fast method to analyze the similarities in image classification datasets. We show that such analysis can provide valuable insights about the datasets and the classification task at hand, prior to training a model. Our method uses wavelet decomposition of images and other numerical analysis tools, with no need for a pre-trained model. Interestingly, the results we obtain corroborate the previous results in the literature that analyzed the similarities using pre-trained CNNs. We show that similar images in standard datasets (such as CIFAR) can be identified in a few seconds, a significant speed-up compared to alternative methods in the literature. By removing the computational speed obstacle, it becomes practical to gain new insights about the contents of datasets and the models trained on them. We show that similarities between training and testing images may provide insights about the generalization of models. Finally, we investigate the similarities between images in relation to decision boundaries of a trained model.

下载PDF全文

下载文献需遵守相关版权规定

论文标题