JOURNAL ARTICLE

UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation

Abdelrahman ShakerMuhammad MaazHanoona RasheedSalman KhanMing–Hsuan YangFahad Shahbaz Khan

Year: 2024 Journal:   IEEE Transactions on Medical Imaging Vol: 43 (9)Pages: 3377-3390   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Owing to the success of transformer models, recent works study their applicability in 3D medical segmentation tasks. Within the transformer models, the self-attention mechanism is one of the main building blocks that strives to capture long-range dependencies, compared to the local convolutional-based design. However, the self-attention operation has quadratic complexity which proves to be a computational bottleneck, especially in volumetric medical imaging, where the inputs are 3D with numerous slices. In this paper, we propose a 3D medical image segmentation approach, named UNETR++, that offers both high-quality segmentation masks as well as efficiency in terms of parameters, compute cost, and inference speed. The core of our design is the introduction of a novel efficient paired attention (EPA) block that efficiently learns spatial and channel-wise discriminative features using a pair of inter-dependent branches based on spatial and channel attention. Our spatial attention formulation is efficient and has linear complexity with respect to the input. To enable communication between spatial and channel-focused branches, we share the weights of query and key mapping functions that provide a complimentary benefit (paired attention), while also reducing the complexity. Our extensive evaluations on five benchmarks, Synapse, BTCV, ACDC, BraTS, and Decathlon-Lung, reveal the effectiveness of our contributions in terms of both efficiency and accuracy. On Synapse, our UNETR++ sets a new state-of-the-art with a Dice Score of 87.2%, while significantly reducing parameters and FLOPs by over 71%, compared to the best method in the literature. Our code and models are available at: https://tinyurl.com/2p87x5xn.

Keywords:
Computer science Segmentation Bottleneck Artificial intelligence Image segmentation Computational complexity theory Discriminative model Inference FLOPS Dice Pattern recognition (psychology) Algorithm Mathematics Parallel computing

Metrics

252
Cited By
129.89
FWCI (Field Weighted Citation Impact)
52
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Medical Image Segmentation Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Radiomics and Machine Learning in Medical Imaging
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging

Related Documents

JOURNAL ARTICLE

Lite Swin UNETR: A Lightweight Version of Swin UNETR for Efficient 3D Medical Image Segmentation

Junyoung ParkMinyoung ParkTaikyeong JeongSungwook Yu

Journal:   Journal of Electrical Engineering and Technology Year: 2025 Vol: 20 (8)Pages: 5455-5469
JOURNAL ARTICLE

UNETR: Transformers for 3D Medical Image Segmentation

Ali HatamizadehYucheng TangVishwesh NathDong YangAndriy MyronenkoBennett A. LandmanHolger R. RothDaguang Xu

Journal:   2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Year: 2022 Pages: 1748-1758
JOURNAL ARTICLE

Slim UNETR++: A lightweight 3D medical image segmentation network for medical image analysis

Jiawei JinSen YangJigang TongKai ZhangZenghui Wang

Journal:   Medical & Biological Engineering & Computing Year: 2025 Vol: 63 (10)Pages: 3123-3137
© 2026 ScienceGate Book Chapters — All rights reserved.