基于小组的深层共享特征学习，用于细粒度的图像分类

论文标题

基于小组的深层共享特征学习，用于细粒度的图像分类

Group Based Deep Shared Feature Learning for Fine-grained Image Classification

论文作者

Li, Xuelu, Monga, Vishal

论文摘要

细粒度的图像分类已成为一个重大挑战，因为此类图像中的对象具有较小的阶层间视觉差异，但姿势，照明和观点等的差异很大。大多数现有工作都集中在通过深层网络体系结构进行高度定制的特征提取，这些网络结构已显示出可提供艺术状态的状态。鉴于细粒度分类中不同类别的图像具有关键的感兴趣的特征，因此我们提出了一种新的深网架构，该架构明确地模拟了共享特征并消除其效果以实现增强的分类结果。我们对共享功能的建模基于一个新的基于组的学习，其中现有类分为组，并发现了多个共享特征模式（学习）。我们将基于此框架组的深层共享功能学习（GSFL）称为GSFL-NET。具体而言，所提出的GSFL-NET开发了一种专门设计的自动编码器，该自动编码器受到新提出的特征表达式损失的约束，将一组特征分解到其组成部分共享和歧视性组件中。在推断期间，仅使用判别特征组件来完成分类任务。我们专业的自动编码器的一个关键好处是它具有多功能性，可以与最先进的精细元素提取模型结合使用，并与它们一起培训以直接提高其性能。基准数据集上的实验表明，GSFL-NET可以通过更容易解释的体系结构提高对技术的分类精度。

Fine-grained image classification has emerged as a significant challenge because objects in such images have small inter-class visual differences but with large variations in pose, lighting, and viewpoints, etc. Most existing work focuses on highly customized feature extraction via deep network architectures which have been shown to deliver state of the art performance. Given that images from distinct classes in fine-grained classification share significant features of interest, we present a new deep network architecture that explicitly models shared features and removes their effect to achieve enhanced classification results. Our modeling of shared features is based on a new group based learning wherein existing classes are divided into groups and multiple shared feature patterns are discovered (learned). We call this framework Group based deep Shared Feature Learning (GSFL) and the resulting learned network as GSFL-Net. Specifically, the proposed GSFL-Net develops a specially designed autoencoder which is constrained by a newly proposed Feature Expression Loss to decompose a set of features into their constituent shared and discriminative components. During inference, only the discriminative feature component is used to accomplish the classification task. A key benefit of our specialized autoencoder is that it is versatile and can be combined with state-of-the-art fine-grained feature extraction models and trained together with them to improve their performance directly. Experiments on benchmark datasets show that GSFL-Net can enhance classification accuracy over the state of the art with a more interpretable architecture.

下载PDF全文

下载文献需遵守相关版权规定

论文标题