JOURNAL ARTICLE

TPAFNet: Transformer-Driven Pyramid Attention Fusion Network for 3D Medical Image Segmentation

Zheng LiJinhui ZhangSiyi WeiYueyang GaoChengwei CaoZhiwei Wu

Year: 2024 Journal:   IEEE Journal of Biomedical and Health Informatics Vol: 28 (11)Pages: 6803-6814   Publisher: Institute of Electrical and Electronics Engineers

Abstract

The field of 3D medical image segmentation is witnessing a growing trend in the utilization of combined networks that integrate convolutional neural networks and transformers. Nevertheless, prevailing hybrid networks are confronted with limitations in their straightforward serial or parallel combination methods and lack an effective mechanism to fuse channel and spatial feature attention. To address these limitations, we present a robust multi-scale 3D medical image segmentation network, the Transformer-Driven Pyramid Attention Fusion Network, which is denoted as TPAFNet, leveraging a hybrid structure of CNN and transformer. Within this framework, we exploit the characteristics of atrous convolution to extract multi-scale information effectively, thereby enhancing the encoding results of the transformer. Furthermore, we introduce the TPAF block in the encoder to seamlessly fuse channel and spatial feature attention from multi-scale feature inputs. In contrast to conventional skip connections that simply concatenate or add features, our decoder is enriched with a TPAF connection, elevating the integration of feature attention between low-level and high-level features. Additionally, we propose a low-level encoding shortcut from the original input to the decoder output, preserving more original image features and contributing to enhanced results. Finally, the deep supervision is implemented using a novel CNN-based voxel-wise classifier to facilitate better network convergence. Experimental results demonstrate that TPAFNet significantly outperforms other state-of-the-art networks on two public datasets, indicating that our research can effectively improve the accuracy of medical image segmentation, thereby assisting doctors in making more precise diagnoses.

Keywords:
Computer science Artificial intelligence Segmentation Encoder Image segmentation Convolutional neural network Pattern recognition (psychology) Feature extraction Deep learning Pyramid (geometry) Computer vision

Metrics

6
Cited By
2.21
FWCI (Field Weighted Citation Impact)
45
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Medical Imaging and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Medical Image Segmentation Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Pyramid Predictive Attention Network for Medical Image Segmentation

Tingxiao YangYuichiro YoshimuraAkira MoritaTakao NamikiToshiya Nakaguchi

Journal:   IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences Year: 2019 Vol: E102.A (9)Pages: 1225-1234
JOURNAL ARTICLE

A multiscale residual pyramid attention network for medical image fusion

Jun FuWeisheng LiJiao DuYuping Huang

Journal:   Biomedical Signal Processing and Control Year: 2021 Vol: 66 Pages: 102488-102488
JOURNAL ARTICLE

Multi-scale feature pyramid fusion network for medical image segmentation

Bing ZhangYan WangCaifu DingZiqing DengLinwei LiZesheng QinZhao DingLifeng BianChen Yang

Journal:   International Journal of Computer Assisted Radiology and Surgery Year: 2022 Vol: 18 (2)Pages: 353-365
© 2026 ScienceGate Book Chapters — All rights reserved.