JOURNAL ARTICLE

CTDPGAN: Infrared and Visible Image Fusion Using CNN-Transformer Dual-Process-Based Generative Adversarial Network

Abstract

Most of the existing image fusion methods prefer to use the adversarial learning game to fuse infrared and visible imagess. However, such single adversarial mechanism makes image fusion task easily ignore global contextual information. To this end, this paper proposes a CNN-Transformer dual-process-based generative adversarial network (CTDPGAN) to fuse infrared and visible images. In generator, a dual-process-based module composed by a CNN block and a Swin-Transformer block is proposed. The channel filter and spatial filter in the CNN block has the ability to adaptively extract additional complementary information from images of various modalities while preserving the shallow features of the source images. The Swin-Transformer Block (STRB) is designed to establish local attention by dividing non-overlapping windows and then to bridge global attention by interacting windows. In addition, we introduce generative adversarial learning networks into the training process, the dual-channel transformer discriminators are designed to improve the discriminative ability of the fused image. Thus, the fused image learns the distribution of global contextual information from source images and retain competitive visible light and infrared domains in more balanced manner. Moreover, we introduce the primary and auxiliary feature concepts into the structural similarity loss function and spatial frequency loss function, which will enable the generator to produce a fused image that retains thermal radiation information and rich detail information. Finally, the experimental findings demonstrate that, in both subjective and objective assessments, our model produces outcomes that are equivalent to or superior compared to state-of-the-art image fusion methods.

Keywords:
Computer science Discriminative model Artificial intelligence Transformer Computer vision Pattern recognition (psychology) Feature learning Engineering

Metrics

1
Cited By
0.22
FWCI (Field Weighted Citation Impact)
39
Refs
0.51
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Dual Generative Adversarial Network for Infrared and Visible Image Fusion

Zhi WangAo Dong

Journal:   Journal of Computing and Electronic Information Management Year: 2025 Vol: 16 (1)Pages: 55-59
JOURNAL ARTICLE

DFPGAN: Dual fusion path generative adversarial network for infrared and visible image fusion

Yi ShiJunjie LiXuesong Yuan

Journal:   Infrared Physics & Technology Year: 2021 Vol: 119 Pages: 103947-103947
JOURNAL ARTICLE

Infrared and Visible Image Fusion Network Based on Transformer-CNN

玉 李

Journal:   Journal of Image and Signal Processing Year: 2025 Vol: 14 (04)Pages: 377-386
JOURNAL ARTICLE

TGFuse: An Infrared and Visible Image Fusion Approach Based on Transformer and Generative Adversarial Network

Dongyu RaoTianyang XuXiao‐Jun Wu

Journal:   IEEE Transactions on Image Processing Year: 2023 Vol: PP Pages: 1-1
JOURNAL ARTICLE

Infrared and visible image fusion using two-layer generative adversarial network

Lei ChenJun HanFeng Tian

Journal:   Journal of Intelligent & Fuzzy Systems Year: 2021 Vol: 40 (6)Pages: 11897-11913
© 2026 ScienceGate Book Chapters — All rights reserved.