Multi-scale prototype convolutional network for few-shot semantic segmentation

Xu Ding; Shun Yu; Jingxuan Zhou; Fusen Guo; Lin Li; Jishizhan Chen

doi:10.1371/journal.pone.0319905

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-scale prototype convolutional network for few-shot semantic segmentation

Xu Ding Shun Yu Jingxuan Zhou Fusen Guo Lin Li Jishizhan Chen

Year: 2025 Journal: PLoS ONE Vol: 20 (4)Pages: e0319905-e0319905 Publisher: Public Library of Science

DOI: 10.1371/journal.pone.0319905

Get Full-Text PDF Get Analytical Report

Abstract

Few-shot semantic segmentation aims to accurately segment objects from a limited amount of annotated data, a task complicated by intra-class variations and prototype representation challenges. To address these issues, we propose the Multi-Scale Prototype Convolutional Network (MPCN). Our approach introduces a Prior Mask Generation (PMG) module, which employs dynamic kernels of varying sizes to capture multi-scale object features. This enhances the interaction between support and query features, thereby improving segmentation accuracy. Additionally, we present a Multi-Scale Prototype Extraction (MPE) module to overcome the limitations of MAP (Mean Average Precision). By augmenting support set features, assessing spatial importance, and utilizing multi-scale downsampling, we obtain a more accurate prototype set. Extensive experiments conducted on the PASCAL- 5i and COCO- 20i datasets demonstrate that our method achieves superior performance in both 1-shot and 5-shot settings.

Keywords:

Computer science Artificial intelligence Algorithm Scale (ratio) Segmentation Pattern recognition (psychology) Physics

Metrics

Cited By

19.28

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multi-scale prototype convolutional network for few-shot semantic segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Generalized Prototype-Based Few-Shot Semantic Segmentation Network

Multi-prototype collaborative perception enhancement network for few-shot semantic segmentation

Prototype Guided Multi-Scale Class Aggregation for Generalized Few-Shot Semantic Segmentation

CLIP-Driven Prototype Network for Few-Shot Semantic Segmentation

Part-Aware Prototype Network for Few-Shot Semantic Segmentation