学习图像操纵检测的层次图表示

论文标题

学习图像操纵检测的层次图表示

Learning Hierarchical Graph Representation for Image Manipulation Detection

论文作者

Pan, Wenyan, Zhou, Zhili, Ling, Miaogen, Geng, Xin, Wu, Q. M. Jonathan

论文摘要

图像操纵检测的目的是识别和定位图像中的操纵区域。最近的方法主要采用复杂的卷积神经网络（CNN）来捕获图像中留下的篡改文物以定位操纵区域。但是，这些方法忽略了特征相关性，即操纵区域和非操纵区域之间的特征相关性，即特征不一致，从而导致检测性能较低。为了解决此问题，我们提出了一个分层图卷积网络（HGCN-NET），该网络由两个平行分支组成：骨干网络分支和分层图表示学习（HGRL）分支用于图像操纵检测。具体而言，给定图像的特征图由骨干网络分支提取，然后将特征图内的特征相关性建模为一组完全连接的图形，用于通过HGRL分支来学习层次结构图表示。学到的层次图表示可以充分捕获不同尺度的特征相关性，因此它为区分操纵和非操纵区域提供了很高的区分性。在四个公共数据集上进行的广泛实验表明，所提出的HGCN-NET不仅提供了有希望的检测准确性，而且在图像操纵检测的任务下，在各种常见的图像攻击下，与最先进的图像相比，还具有强大的鲁棒性。

The objective of image manipulation detection is to identify and locate the manipulated regions in the images. Recent approaches mostly adopt the sophisticated Convolutional Neural Networks (CNNs) to capture the tampering artifacts left in the images to locate the manipulated regions. However, these approaches ignore the feature correlations, i.e., feature inconsistencies, between manipulated regions and non-manipulated regions, leading to inferior detection performance. To address this issue, we propose a hierarchical Graph Convolutional Network (HGCN-Net), which consists of two parallel branches: the backbone network branch and the hierarchical graph representation learning (HGRL) branch for image manipulation detection. Specifically, the feature maps of a given image are extracted by the backbone network branch, and then the feature correlations within the feature maps are modeled as a set of fully-connected graphs for learning the hierarchical graph representation by the HGRL branch. The learned hierarchical graph representation can sufficiently capture the feature correlations across different scales, and thus it provides high discriminability for distinguishing manipulated and non-manipulated regions. Extensive experiments on four public datasets demonstrate that the proposed HGCN-Net not only provides promising detection accuracy, but also achieves strong robustness under a variety of common image attacks in the task of image manipulation detection, compared to the state-of-the-arts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题