JOURNAL ARTICLE

MACTFusion: Lightweight Cross Transformer for Adaptive Multimodal Medical Image Fusion

Xinyu XieXiaozhi ZhangXinglong TangJiaxi ZhaoDongping XiongLijun OuyangBin YangHong ZhouBingo Wing‐Kuen LingKok Lay Teo

Year: 2024 Journal:   IEEE Journal of Biomedical and Health Informatics Vol: 29 (5)Pages: 3317-3328   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Multimodal medical image fusion aims to integrate complementary information from different modalities of medical images. Deep learning methods, especially recent vision Transformers, have effectively improved image fusion performance. However, there are limitations for Transformers in image fusion, such as lacks of local feature extraction and cross-modal feature interaction, resulting in insufficient multimodal feature extraction and integration. In addition, the computational cost of Transformers is higher. To address these challenges, in this work, we develop an adaptive cross-modal fusion strategy for unsupervised multimodal medical image fusion. Specifically, we propose a novel lightweight cross Transformer based on cross multi-axis attention mechanism. It includes cross-window attention and cross-grid attention to mine and integrate both local and global interactions of multimodal features. The cross Transformer is further guided by a spatial adaptation fusion module, which allows the model to focus on the most relevant information. Moreover, we design a special feature extraction module that combines multiple gradient residual dense convolutional and Transformer layers to obtain local features from coarse to fine and capture global features. The proposed strategy significantly boosts the fusion performance while minimizing computational costs. Extensive experiments, including clinical brain tumor image fusion, have shown that our model can achieve clearer texture details and better visual quality than other state-of-the-art fusion methods.

Keywords:
Computer science Artificial intelligence Feature extraction Image fusion Transformer Computer vision Pattern recognition (psychology) Fusion Multimodality Image (mathematics) Engineering

Metrics

29
Cited By
17.83
FWCI (Field Weighted Citation Impact)
48
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Photoacoustic and Ultrasonic Imaging
Physical Sciences →  Engineering →  Biomedical Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.