学习零拍识别的可簇视觉功能

论文标题

学习零拍识别的可簇视觉功能

Learning Clusterable Visual Features for Zero-Shot Recognition

论文作者

Xu, Jingyi, Shu, Zhixin, Samaras, Dimitris

论文摘要

在零拍学习（ZSL）中，有条件的发电机已被广泛用于生成其他训练功能。然后，这些功能可用于训练分类器进行测试数据。但是，某些测试数据被认为是“硬”，因为它们靠近决策边界，并且容易出现错误分类，从而导致ZSL的性能退化。在本文中，我们建议学习有关ZSL问题的簇功能。使用有条件的变异自动编码器（CVAE）作为特征生成器，我们将原始功能投射到由辅助分类损失监督的新功能空间。为了进一步提高凝聚力，我们使用高斯相似性损失微调了特征。可簇的视觉特征不仅更适合CVAE重建，而且更可分离，从而提高了分类精度。此外，我们引入高斯噪声，以扩大生成特征的类内差异，这有助于提高分类器的鲁棒性。我们对太阳，幼崽和AWA2数据集的实验表现出比以前最先进的ZSL结果的一致性提高。除了对零射击分类的有效性外，实验还表明，提高功能凝聚性的方法也有益于射击学习算法。

In zero-shot learning (ZSL), conditional generators have been widely used to generate additional training features. These features can then be used to train the classifiers for testing data. However, some testing data are considered "hard" as they lie close to the decision boundaries and are prone to misclassification, leading to performance degradation for ZSL. In this paper, we propose to learn clusterable features for ZSL problems. Using a Conditional Variational Autoencoder (CVAE) as the feature generator, we project the original features to a new feature space supervised by an auxiliary classification loss. To further increase clusterability, we fine-tune the features using Gaussian similarity loss. The clusterable visual features are not only more suitable for CVAE reconstruction but are also more separable which improves classification accuracy. Moreover, we introduce Gaussian noise to enlarge the intra-class variance of the generated features, which helps to improve the classifier's robustness. Our experiments on SUN,CUB, and AWA2 datasets show consistent improvement over previous state-of-the-art ZSL results by a large margin. In addition to its effectiveness on zero-shot classification, experiments show that our method to increase feature clusterability benefits few-shot learning algorithms as well.

下载PDF全文

下载文献需遵守相关版权规定

论文标题