JOURNAL ARTICLE

An Infrared and Visible Image Fusion Network Based on Res2Net and Multiscale Transformer

B.T.G. TanBin Yang

Year: 2025 Journal:   Sensors Vol: 25 (3)Pages: 791-791   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The aim of infrared and visible image fusion is to produce a composite image that can highlight the infrared targets and maintain plentiful detailed textures simultaneously. Despite the promising fusion performance of current deep-learning-based algorithms, most fusion algorithms highly depend on convolution operations, which limits their capability to represent long-range contextual information. To overcome this challenge, we design a novel infrared and visible image fusion network based on Res2Net and multiscale Transformer, called RMTFuse. Specifically, we devise a local feature extraction module based on Res2Net (LFE-RN) in which dense connections are adopted to reuse the information that might be lost in convolution operation and a global feature extraction module based on multiscale Transformer (GFE-MT) which is composed of a Transformer module and a global feature integration module (GFIM). The Transformer module extracts the coarse-to-fine semantic features of the source images, while GFIM is used to further aggregate the hierarchical features to strengthen contextual feature representations. Furthermore, we employ the pre-trained VGG-16 network to compute the loss of features with different depths. Massive experiments on mainstream datasets indicate that RMTFuse is superior to the state-of-the-art methods in both subjective and objective assessments.

Keywords:
Computer science Transformer Feature extraction Fusion Artificial intelligence Reuse Pattern recognition (psychology) Image fusion Convolutional neural network Image (mathematics) Engineering Voltage

Metrics

2
Cited By
7.03
FWCI (Field Weighted Citation Impact)
55
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Photoacoustic and Ultrasonic Imaging
Physical Sciences →  Engineering →  Biomedical Engineering
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Infrared and Visible Image Fusion Based on Res2Net-Transformer Automatic Encoding and Decoding

Chunming WuW LiuXin Ma

Journal:   Computers, materials & continua/Computers, materials & continua (Print) Year: 2024 Vol: 79 (1)Pages: 1441-1461
JOURNAL ARTICLE

MCnet: Multiscale visible image and infrared image fusion network

Le SunYuhang LiMin ZhengZhaoyi ZhongYanchun Zhang

Journal:   Signal Processing Year: 2023 Vol: 208 Pages: 108996-108996
JOURNAL ARTICLE

Infrared and Visible Image Fusion Network Based on Transformer-CNN

玉 李

Journal:   Journal of Image and Signal Processing Year: 2025 Vol: 14 (04)Pages: 377-386
JOURNAL ARTICLE

Lightweight Infrared and Visible Image Fusion Based on Nested Connections and Res2Net

Peng YiXinyue TuQingqing Yang

Journal:   Applied Sciences Year: 2024 Vol: 14 (11)Pages: 4589-4589
© 2026 ScienceGate Book Chapters — All rights reserved.