通过完全卷积网络的自我监督学习

论文标题

通过完全卷积网络的自我监督学习

Self-supervised Learning with Fully Convolutional Networks

论文作者

Yang, Zhengeng, Yu, Hongshan, He, Yong, Mao, Zhi-Hong, Mian, Ajmal

论文摘要

尽管基于深度学习的方法在许多计算机视觉任务中取得了巨大的成功，但它们的性能取决于通常难以获得的大量密集注释的样本。在本文中，我们关注从未标记的数据进行语义细分的学习表示的问题。受两种基于补丁的方法的启发，我们通过将拼图拼图问题作为贴片分类过程制定并通过完全卷积的网络来解决它，从而开发了一种新颖的自我监督学习框架。通过学习用25个补丁解决拼图拼图问题，并将学习的功能传输到CityScapes数据集上的语义分割任务，我们比从随机值初始化的基线模型实现了5.8个百分点的改善。此外，实验表明，我们的自学学习方法可以应用于不同的数据集和模型。特别是，我们使用Pascal VOC2012数据集上的最新方法实现了竞争性能，使用较少的培训图像。

Although deep learning based methods have achieved great success in many computer vision tasks, their performance relies on a large number of densely annotated samples that are typically difficult to obtain. In this paper, we focus on the problem of learning representation from unlabeled data for semantic segmentation. Inspired by two patch-based methods, we develop a novel self-supervised learning framework by formulating the Jigsaw Puzzle problem as a patch-wise classification process and solving it with a fully convolutional network. By learning to solve a Jigsaw Puzzle problem with 25 patches and transferring the learned features to semantic segmentation task on Cityscapes dataset, we achieve a 5.8 percentage point improvement over the baseline model that initialized from random values. Moreover, experiments show that our self-supervised learning method can be applied to different datasets and models. In particular, we achieved competitive performance with the state-of-the-art methods on the PASCAL VOC2012 dataset using significant fewer training images.

下载PDF全文

下载文献需遵守相关版权规定

论文标题