Medical Image Segmentation with Dual-Encoding and Multi-Level Feature Adaptive Fusion

Shulei Wu; You Yang; Fanghong Zhang

doi:10.1142/s0218001424540041

ScienceGate Book Chapters

JOURNAL ARTICLE

Medical Image Segmentation with Dual-Encoding and Multi-Level Feature Adaptive Fusion

Shulei Wu You Yang Fanghong Zhang

Year: 2024 Journal: International Journal of Pattern Recognition and Artificial Intelligence Vol: 38 (04) Publisher: World Scientific

DOI: 10.1142/s0218001424540041

Get Full-Text PDF Get Analytical Report

Abstract

Purpose: Accurate segmentation of medical images is critical for disease diagnosis, surgical planning and prognostic assessment. TransUNet, a hybrid CNN-Transformer-based method, extracts local features using CNN and compensates for the lack of long-range dependencies through a self-attention mechanism. However, the initial focus on extracting local features from specific regions impacts the generation of subsequent global features, thus constraining the model’s capacity to effectively capture a broader range of semantic information. Effective integration of local and global features plays a pivotal role in achieving precise and dense prediction. Therefore, we propose a novel hybrid CNN-Transformer-based method aimed at enhancing medical image segmentation. Approach: In this study, a dual-encoder parallel structure is used to enhance the feature representation of the input image. By introducing a multi-scale adaptive feature fusion module, a fine fusion of local features across perceptual domains is realized in the decoding process. The generalized convolutional block attention module helps to increase cross-channel interactions in layers with more channels, thus enabling the fusion of local features and global representations at different resolutions during the decoding process. Results: The proposed method achieves average DSC scores of 79.98%, 84.83% and 85.78% on the Synapse, ISIC2017 and Pediatric Pyelonephritis datasets, respectively. These scores are 2.5%, 0.56% and 0.42% higher than those of TransUNet. The best performance of 91.66% is observed on the ACDC dataset, representing improvements of 2.46% and 7.24% compared to HiFormer and DAE-Former, respectively. Conclusions: The experimental results show that the proposed model has a significant competitive advantage in terms of ACDC image segmentation performance.

Keywords:

Artificial intelligence Computer science Encoding (memory) Feature (linguistics) Computer vision Pattern recognition (psychology) Segmentation Image (mathematics) Image segmentation Dual (grammatical number)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.03

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Medical Image Segmentation Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Medical Image Segmentation with Dual-Encoding and Multi-Level Feature Adaptive Fusion

Abstract

Metrics

Topics

Related Documents

DAMAF: dual attention network with multi-level adaptive complementary fusion for medical image segmentation

AMFF-NET: Adaptive Multi-Layer Feature Fusion Network for Medical Image Segmentation

LAMFFNet: Lightweight Adaptive Multi-layer Feature Fusion network for medical image segmentation

DEFIF-Net: A lightweight dual-encoding feature interaction fusion network for medical image segmentation

DMD: Dual attention fusion and multi-scale feature fusion decoding for medical image segmentation