JOURNAL ARTICLE

TransAttUnet: Multi-Level Attention-Guided U-Net With Transformer for Medical Image Segmentation

Bingzhi ChenYishu LiuZheng ZhangGuangming LuAdams Wai‐Kin Kong

Year: 2023 Journal:   IEEE Transactions on Emerging Topics in Computational Intelligence Vol: 8 (1)Pages: 55-68   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Accurate segmentation of organs or lesions from medical images is crucial for reliable diagnosis of diseases and organ morphometry. In recent years, convolutional encoder-decoder solutions have achieved substantial progress in the field of automatic medical image segmentation. Due to the inherent bias in the convolution operations, prior models mainly focus on local visual cues formed by the neighboring pixels, but fail to fully model the long-range contextual dependencies. In this article, we propose a novel Transformer-based Attention Guided Network called TransAttUnet , in which the multi-level guided attention and multi-scale skip connection are designed to jointly enhance the performance of the semantical segmentation architecture. Inspired by Transformer, the self-aware attention (SAA) module with Transformer Self Attention (TSA) and Global Spatial Attention (GSA) is incorporated into TransAttUnet to effectively learn the non-local interactions among encoder features. Moreover, we also use additional multi-scale skip connections between decoder blocks to aggregate the upsampled features with different semantic scales. In this way, the representation ability of multi-scale context information is strengthened to generate discriminative features. Benefitting from these complementary components, the proposed TransAttUnet can effectively alleviate the loss of fine details caused by the stacking of convolution layers and the consecutive sampling operations, finally improving the segmentation quality of medical images. Extensive experiments were conducted on multiple medical image segmentation datasets from various imaging modalities, which demonstrate that the proposed method consistently outperforms the existing state-of-the-art methods.

Keywords:
Computer science Segmentation Artificial intelligence Encoder Discriminative model Transformer Image segmentation Pixel Pattern recognition (psychology) Computer vision

Metrics

301
Cited By
54.04
FWCI (Field Weighted Citation Impact)
47
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Radiomics and Machine Learning in Medical Imaging
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging
COVID-19 diagnosis using AI
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging

Related Documents

JOURNAL ARTICLE

Multi-scale Neighborhood Attention Transformer on U-Net for Medical Image Segmentation

Nanxing ZhangShiqiang MaXuejian LiJiahui ZhangJijun TangFei Guo

Journal:   2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) Year: 2022 Pages: 1381-1386
JOURNAL ARTICLE

Enhancing medical image segmentation with a multi-transformer U-Net

Yongping DanWeishou JinXuebin YueZhida Wang

Journal:   PeerJ Year: 2024 Vol: 12 Pages: e17005-e17005
JOURNAL ARTICLE

Multi-scale conv-attention U-Net for medical image segmentation

Linqiang PanChengxue ZhangJingbo SunLina Guo

Journal:   Scientific Reports Year: 2025 Vol: 15 (1)Pages: 12041-12041
© 2026 ScienceGate Book Chapters — All rights reserved.