Dual Stream Fusion U-Net Transformers for 3D Medical Image Segmentation

Seungkyun Hong; Sunghyun Ahn; Youngwan Jo; Sanghyun Park

doi:10.1109/bigcomp60711.2024.00054

ScienceGate Book Chapters

JOURNAL ARTICLE

Dual Stream Fusion U-Net Transformers for 3D Medical Image Segmentation

Seungkyun Hong Sunghyun Ahn Youngwan Jo Sanghyun Park

Year: 2024 Pages: 301-308

DOI: 10.1109/bigcomp60711.2024.00054

Get Full-Text PDF Get Analytical Report

Abstract

Medical image segmentation is a crucial ongoing issue in clinical applications for differentiating lesions and seg-menting various organs to extract relevant features. Many recent studies have combined transformers, which enable global context modeling leveraging self-attention, with U-Nets to distinguish or-gans in complex volumetric medical images such as 3-dimensional (3D) Computed Tomography (CT) and Magnetic Resonance Imaging (MRI) images. In this study, we propose a Dual Stream fusion U-NEt TRansformers (DS-UNETR) comprising a Dual Stream Attention Encoder (DS-AE) and Bidirectional All Scale Fusion (Bi-ASF) module. We designed the DS-AE that extracts both spatial and channel features in parallel streams to better understand the relation between channels. When transferring the extracted features from the DS-AE to the decoder, we used the Bi-ASF module to fuse all scale features. We achieved an average Dice similarity coefficient (Dice score) improvement of 0.97 % and a 95 % Hausdorff distance (HD95), indicating an improvement of 7.43% compared to that for a state-of-the-art model on the Synapse dataset. We also demonstrated the efficiency of our model by reducing the space and time complexity with a decrease of 80.73 % in parameters and 78.86 % in FLoating point OPerationS (FLOPS). Our proposed model, DS-UNETR, shows superior performance and efficiency in terms of segmentation accuracy and model complexity (both space and time) compared to existing state-of-the-art models on the 3D medical image segmentation benchmark dataset. The approach of our proposed model can be effectively applied in various medical big data analysis applications.

Keywords:

Computer science Artificial intelligence Image segmentation Computer vision Image fusion Transformer Dual (grammatical number) Segmentation Pattern recognition (psychology) Image (mathematics) Engineering Electrical engineering Voltage

Metrics

Cited By

1.04

FWCI (Field Weighted Citation Impact)

Refs

0.66

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Brain Tumor Detection and Classification

Life Sciences → Neuroscience → Neurology

Medical Image Segmentation Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Dual Stream Fusion U-Net Transformers for 3D Medical Image Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

DEU-Net: Dual Encoder U-Net for 3D Medical Image Segmentation

Dual-stream Representation Fusion Learning for accurate medical image segmentation

TU-Net: U-shaped Structure Based on Transformers for Medical Image Segmentation

AFT-Net: Active Fusion-Transduction for Multi-stream Medical Image Segmentation

Application of Multi Scale Fusion U-Net in Medical Image Segmentation