论文标题

Swinfuse:用于红外和可见图像的残留Swin变压器融合网络

SwinFuse: A Residual Swin Transformer Fusion Network for Infrared and Visible Images

论文作者

Wang, Zhishe, Chen, Yanlin, Shao, Wenyu, Li, Hui, Zhang, Lei

论文摘要

现有的深度学习融合方法主要集中在卷积神经网络上,很少尝试使用变压器进行尝试。同时,卷积操作是图像和卷积内核之间与内容无关的相互作用,它可能会失去一些重要的环境并进一步限制融合性能。为此,我们为红外且可见的图像提供了一个简单而强的融合基线,即\ textIt {残留的Swin Transformer融合网络},称为Swinfuse。我们的Swinfuse包括三个部分:全局特征提取,融合层和特征重建。特别是,我们建立了一个完全注意的特征,该特征编码骨干,以建模远程依赖性,该依赖性是纯变压器网络,与卷积神经网络相比具有更强的表示能力。此外,我们设计了一种基于$ l_ {1} $的新型功能融合策略 - 序列矩阵的规范,并从行和列矢量尺寸中测量相应的活动水平,可以很好地保留竞争性红外亮度和独特的可见细节。最后,我们通过主观观察和客观比较在三个不同数据集上使用九种最先进的传统和深度学习方法作证,实验结果表明,拟议的Swinfuse具有出人意料的融合性能,具有强大的概括能力和竞争性的计算效率。该代码将在https://github.com/zhishe-wang/swinfuse上找到。

The existing deep learning fusion methods mainly concentrate on the convolutional neural networks, and few attempts are made with transformer. Meanwhile, the convolutional operation is a content-independent interaction between the image and convolution kernel, which may lose some important contexts and further limit fusion performance. Towards this end, we present a simple and strong fusion baseline for infrared and visible images, namely\textit{ Residual Swin Transformer Fusion Network}, termed as SwinFuse. Our SwinFuse includes three parts: the global feature extraction, fusion layer and feature reconstruction. In particular, we build a fully attentional feature encoding backbone to model the long-range dependency, which is a pure transformer network and has a stronger representation ability compared with the convolutional neural networks. Moreover, we design a novel feature fusion strategy based on $L_{1}$-norm for sequence matrices, and measure the corresponding activity levels from row and column vector dimensions, which can well retain competitive infrared brightness and distinct visible details. Finally, we testify our SwinFuse with nine state-of-the-art traditional and deep learning methods on three different datasets through subjective observations and objective comparisons, and the experimental results manifest that the proposed SwinFuse obtains surprising fusion performance with strong generalization ability and competitive computational efficiency. The code will be available at https://github.com/Zhishe-Wang/SwinFuse.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源