JOURNAL ARTICLE

Hierarchical Multi-Scale Enhanced Transformer for Medical Image Segmentation

Yantao SongYunli LuLu ChenYimin Luo

Year: 2024 Journal:   IEEE Journal of Biomedical and Health Informatics Vol: 29 (12)Pages: 8917-8927   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Segmentation is an important prerequisite for developing model healthcare systems, particularly for disease diagnosis and treatment planning. In the field of medical image segmentation, the U-shaped architecture, commonly referred to as U-Net, has emerged as the de facto standard and achieved remarkable success. However, due to the intrinsic locality of convolution operations, U-Net generally demonstrates limitations in explicitly modeling long-range dependency. Recent transformer-based models, designed for sequence-to-sequence prediction, have emerged as an alternative to traditional architectures, featuring innate global self-attention mechanisms. Unfortunately, they may sometimes suffer from limited localization abilities due to a lack of sufficient low-level details. To merit both Transformers and U-Net, in this paper, we propose a novel two-channel self-attention mechanism U-network, which performs feature extraction from two channels, CNN and Transformer, respectively. Compared to previous models, we propose two hierarchical feature fusion strategies from both spatial and channel dimensions. Moreover, to further promote the model performance, a loss function that can dynamically adjust the weights according to the output of each layer is constructed. Experimental results on five different datasets show that our method performs consistently outperforms state-of-the-art methods, and it also has an outstanding generalization ability to various medical image modalities.

Keywords:
Image segmentation Computer science Artificial intelligence Computer vision Segmentation Transformer Scale (ratio) Scale-space segmentation Pattern recognition (psychology) Engineering Cartography Voltage Electrical engineering Geography

Metrics

3
Cited By
1.59
FWCI (Field Weighted Citation Impact)
19
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Medical Image Segmentation Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Brain Tumor Detection and Classification
Life Sciences →  Neuroscience →  Neurology
© 2026 ScienceGate Book Chapters — All rights reserved.