CTDPGAN: Infrared and Visible Image Fusion Using CNN-Transformer Dual-Process-Based Generative Adversarial Network

Cheng Zhang; Xuan Li; Yuhang Xu; Jie Wang; Rong‐Fu Chen

doi:10.23919/ccc58697.2023.10239821

ScienceGate Book Chapters

JOURNAL ARTICLE

CTDPGAN: Infrared and Visible Image Fusion Using CNN-Transformer Dual-Process-Based Generative Adversarial Network

Cheng Zhang Xuan Li Yuhang Xu Jie Wang Rong‐Fu Chen

Year: 2023 Pages: 8044-8051

DOI: 10.23919/ccc58697.2023.10239821

Get Full-Text PDF Get Analytical Report

Abstract

Most of the existing image fusion methods prefer to use the adversarial learning game to fuse infrared and visible imagess. However, such single adversarial mechanism makes image fusion task easily ignore global contextual information. To this end, this paper proposes a CNN-Transformer dual-process-based generative adversarial network (CTDPGAN) to fuse infrared and visible images. In generator, a dual-process-based module composed by a CNN block and a Swin-Transformer block is proposed. The channel filter and spatial filter in the CNN block has the ability to adaptively extract additional complementary information from images of various modalities while preserving the shallow features of the source images. The Swin-Transformer Block (STRB) is designed to establish local attention by dividing non-overlapping windows and then to bridge global attention by interacting windows. In addition, we introduce generative adversarial learning networks into the training process, the dual-channel transformer discriminators are designed to improve the discriminative ability of the fused image. Thus, the fused image learns the distribution of global contextual information from source images and retain competitive visible light and infrared domains in more balanced manner. Moreover, we introduce the primary and auxiliary feature concepts into the structural similarity loss function and spatial frequency loss function, which will enable the generator to produce a fused image that retains thermal radiation information and rich detail information. Finally, the experimental findings demonstrate that, in both subjective and objective assessments, our model produces outcomes that are equivalent to or superior compared to state-of-the-art image fusion methods.

Keywords:

Computer science Discriminative model Artificial intelligence Transformer Computer vision Pattern recognition (psychology) Feature learning Engineering

Metrics

Cited By

0.22

FWCI (Field Weighted Citation Impact)

Refs

0.51

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image and Signal Denoising Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

CTDPGAN: Infrared and Visible Image Fusion Using CNN-Transformer Dual-Process-Based Generative Adversarial Network

Abstract

Metrics

Citation History

Topics

Related Documents

Dual Generative Adversarial Network for Infrared and Visible Image Fusion

DFPGAN: Dual fusion path generative adversarial network for infrared and visible image fusion

Infrared and Visible Image Fusion Network Based on Transformer-CNN

TGFuse: An Infrared and Visible Image Fusion Approach Based on Transformer and Generative Adversarial Network

Infrared and visible image fusion using two-layer generative adversarial network