Hierarchical Multi-Scale Enhanced Transformer for Medical Image Segmentation

Yantao Song; Yunli Lu; Lu Chen; Yimin Luo

doi:10.1109/jbhi.2024.3515477

ScienceGate Book Chapters

JOURNAL ARTICLE

Hierarchical Multi-Scale Enhanced Transformer for Medical Image Segmentation

Yantao Song Yunli Lu Lu Chen Yimin Luo

Year: 2024 Journal: IEEE Journal of Biomedical and Health Informatics Vol: 29 (12)Pages: 8917-8927 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jbhi.2024.3515477

Get Full-Text PDF Get Analytical Report

Abstract

Segmentation is an important prerequisite for developing model healthcare systems, particularly for disease diagnosis and treatment planning. In the field of medical image segmentation, the U-shaped architecture, commonly referred to as U-Net, has emerged as the de facto standard and achieved remarkable success. However, due to the intrinsic locality of convolution operations, U-Net generally demonstrates limitations in explicitly modeling long-range dependency. Recent transformer-based models, designed for sequence-to-sequence prediction, have emerged as an alternative to traditional architectures, featuring innate global self-attention mechanisms. Unfortunately, they may sometimes suffer from limited localization abilities due to a lack of sufficient low-level details. To merit both Transformers and U-Net, in this paper, we propose a novel two-channel self-attention mechanism U-network, which performs feature extraction from two channels, CNN and Transformer, respectively. Compared to previous models, we propose two hierarchical feature fusion strategies from both spatial and channel dimensions. Moreover, to further promote the model performance, a loss function that can dynamically adjust the weights according to the output of each layer is constructed. Experimental results on five different datasets show that our method performs consistently outperforms state-of-the-art methods, and it also has an outstanding generalization ability to various medical image modalities.

Keywords:

Image segmentation Computer science Artificial intelligence Computer vision Segmentation Transformer Scale (ratio) Scale-space segmentation Pattern recognition (psychology) Engineering Cartography Voltage Electrical engineering Geography

Metrics

Cited By

1.59

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Medical Image Segmentation Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Brain Tumor Detection and Classification

Life Sciences → Neuroscience → Neurology

Hierarchical Multi-Scale Enhanced Transformer for Medical Image Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-scale Hierarchical Transformer structure for 3D medical image segmentation

HiCASCADE: Hierarchical Multi-scale Transformer with CASCADE for Medical Image Segmentation

Multi-scale convolutional attention frequency-enhanced transformer network for medical image segmentation

Grouped multi-scale vision transformer for medical image segmentation

MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion