TCTFusion: A Triple-Branch Cross-Modal Transformer for Adaptive Infrared and Visible Image Fusion

Liang Zhang; Yueqiu Jiang; Wei Yang; B. Liu

doi:10.3390/electronics14040731

ScienceGate Book Chapters

JOURNAL ARTICLE

TCTFusion: A Triple-Branch Cross-Modal Transformer for Adaptive Infrared and Visible Image Fusion

Liang Zhang Yueqiu Jiang Wei Yang B. Liu

Year: 2025 Journal: Electronics Vol: 14 (4)Pages: 731-731 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/electronics14040731

Get Full-Text PDF Get Analytical Report

Abstract

Infrared-visible image fusion (IVIF) is an important part of multimodal image fusion (MMF). Our goal is to combine useful information from infrared and visible sources to produce strong, detailed, fused images that help people understand scenes better. However, most existing fusion methods based on convolutional neural networks extract cross-modal local features without fully utilizing long-range contextual information. This limitation reduces performance, especially in complex scenarios. To address this issue, we propose TCTFusion, a three-branch cross-modal transformer for visible–infrared image fusion. The model includes a shallow feature module (SFM), a frequency decomposition module (FDM), and an information aggregation module (IAM). The three branches specifically receive input from infrared, visible, and concatenated images. The SFM extracts cross-modal shallow features using residual connections with shared weights. The FDM then captures low-frequency global information across modalities and high-frequency local information within each modality. The IAM aggregates complementary cross-modal features, enabling the full interaction between different modalities. Finally, the decoder generates the fused image. Additionally, we introduce pixel loss and structural loss to significantly improve the model’s overall performance. Extensive experiments on mainstream datasets demonstrate that TCTFusion outperforms other state-of-the-art methods in both qualitative and quantitative evaluations.

Keywords:

Modal Fusion Infrared Artificial intelligence Computer vision Computer science Physics Optics Materials science

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.04

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image and Signal Denoising Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

TCTFusion: A Triple-Branch Cross-Modal Transformer for Adaptive Infrared and Visible Image Fusion

Abstract

Metrics

Topics

Related Documents

Infrared–Visible Image Fusion via Cross-Modal Guided Dual-Branch Networks

Dual-branch visible and infrared image fusion transformer

MFST: Multi-Modal Feature Self-Adaptive Transformer for Infrared and Visible Image Fusion

Using Edge-Guided Cross-Modal Transformer for Infrared and Visible Light Image Fusion

A Dual-branch and Cross-domain Transformer Network for infrared and visible image fusion