JOURNAL ARTICLE

Infrared and Visible Image Fusion with Overlapped Window Transformer

Xingwang LiuBemnet Wondimagegnehu MershaKaoru HirotaYaping Dai

Year: 2025 Journal:   Journal of Advanced Computational Intelligence and Intelligent Informatics Vol: 29 (4)Pages: 838-846   Publisher: Fuji Technology Press Ltd.

Abstract

An overlap window-based transformer is proposed for infrared and visible image fusion. A multi-head self-attention mechanism based on overlapping windows is designed. By introducing overlapping regions between windows, local features can interact across different windows, avoiding the discontinuity and information isolation issues caused by non-overlapping partitions. The proposed model is trained using an unsupervised loss function composed of three terms: pixel, gradient, and structural loss. With the end-to-end model and the unsupervised loss function, our method eliminates the need to manually design complex activity-level measurements and fusion strategies. Extensive experiments on the public TNO (grayscale) and RoadScene (RGB) datasets demonstrate that the proposed method achieves the expected long-distance dependency modeling capabilities when fusing infrared and visible images, as well as the positive results in both qualitative and quantitative evaluations.

Keywords:
Computer science Artificial intelligence Grayscale Fusion Transformer Computer vision Pixel Window (computing) Discontinuity (linguistics) Window function Pattern recognition (psychology) Image fusion Image (mathematics)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
27
Refs
0.36
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.