JOURNAL ARTICLE

META-Unet: Multi-Scale Efficient Transformer Attention Unet for Fast and High-Accuracy Polyp Segmentation

Huisi WuZebin ZhaoZhaoze Wang

Year: 2023 Journal:   IEEE Transactions on Automation Science and Engineering Vol: 21 (3)Pages: 4117-4128   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Polyp segmentation plays an important role in preventing Colorectal cancer. Although Vision Transformer has been widely introduced in medical image segmentation to compensate the limitations of traditional CNN in modeling global context, its shortcomings in learning the fine-detailed features and the heavy computation cost also hinder its application in challenging polyp segmentation due to the various shapes and sizes of polyps, the low-intensity contrast between polyps and surrounding tissues, and the inherent real-time requirement. In this paper, we propose a multi-scale efficient transformer attention (META) mechanism for fast and high-accuracy polyp segmentation, where efficient transformer blocks are employed to generate multi-scale element-wise attentions for adaptive feature fusion in the famous U-shape encoder-decoder architecture. Specifically, our META mechanism includes two branches to capture multi-scale long-term dependencies, which are implemented via two efficient transformer blocks with different resolutions. The local branch is used to capture a relatively smaller transform attention under a relatively lower resolution, while the global branch is used to capture high-resolution transform attention. The final poly segmentation results are progressively integrated based on the META mechanism in each layer of the decoder. Extensive experiments are conducted on four polyp segmentation datasets (CVC-ClinicDB, Endoscenestill, Kvasir-SEG and ETIS-Larib) to demonstrate its advantages, consistently outperforming different competitors. While using ResNet34 as backbones, it can achieve 85.78% IoU and 92.03% Dice, 88.99% IoU and 93.85% Dice, 86.42% IoU and 91.86% Dice respectively in CVC-ClinicDB, Endoscenestill, and Kvasir-SEG, and a speed of 98 FPS at the input size of $3 \times 512 \times 512$ on a NVIDIA GeForce RTX 3090 card. The code is available at https://github.com/szuzzb/META-Unet. Note to Practitioners —Automatic polyp segmentation is a crucial step of polyp recognition and diagnostic of colonoscopy, which usually require both high-accuracy and real-time performance. This article proposes a novel polyp segmentation method, namely META-Unet, by modeling multi-scale attention maps effectively and efficiently based on a novel multi-scale efficient transformer attention (META) mechanism, for faster and higher-accuracy polyp segmentation. We evaluate our META-Unet on four public polyp image segmentation datasets (CVC-ClinicDB, Endoscenestill, Kvasir-SEG and ETIS-Larib). Comprehensive experimental results validate its outstanding performance with a better balance in both accuracy and inference speed. The proposed META mechanism is potentially to be embedded in various deep learning frameworks and facilitates more computer-aided applications in clinical practice.

Keywords:
Segmentation Computer science Artificial intelligence Encoder Image segmentation Dice Pattern recognition (psychology) Transformer Computer vision Engineering Voltage Mathematics

Metrics

85
Cited By
26.27
FWCI (Field Weighted Citation Impact)
58
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Radiomics and Machine Learning in Medical Imaging
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Colorectal Cancer Screening and Detection
Health Sciences →  Medicine →  Oncology

Related Documents

JOURNAL ARTICLE

Multi‐scale nested UNet with transformer for colorectal polyp segmentation

Zenan WangZhen LiuJianfeng YuYingxin GaoMing Liu

Journal:   Journal of Applied Clinical Medical Physics Year: 2024 Vol: 25 (6)Pages: e14351-e14351
JOURNAL ARTICLE

Synergistic Multi-Granularity Rough Attention UNet for Polyp Segmentation

Jing WangChia S. Lim

Journal:   Journal of Imaging Year: 2025 Vol: 11 (4)Pages: 92-92
JOURNAL ARTICLE

AFC-Unet: Attention-fused full-scale CNN-transformer unet for medical image segmentation

W.J. MengShujun LiuHuajun Wang

Journal:   Biomedical Signal Processing and Control Year: 2024 Vol: 99 Pages: 106839-106839
JOURNAL ARTICLE

ACU-TransNet: Attention and convolution-augmented UNet-transformer network for polyp segmentation

Lei HuangYun Wu

Journal:   Journal of X-Ray Science and Technology Year: 2024 Vol: 32 (6)Pages: 1-16
© 2026 ScienceGate Book Chapters — All rights reserved.