Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Jianwu Li; Kaiyue Shi; Guo-Sen Xie; Xiaofeng Liu; Jian Zhang; Tianfei Zhou

doi:10.1609/aaai.v38i4.28094

ScienceGate Book Chapters

JOURNAL ARTICLE

Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Jianwu Li Kaiyue Shi Guo-Sen Xie Xiaofeng Liu Jian Zhang Tianfei Zhou

Year: 2024 Journal: Proceedings of the AAAI Conference on Artificial Intelligence Vol: 38 (4)Pages: 3109-3117 Publisher: Association for the Advancement of Artificial Intelligence

DOI: 10.1609/aaai.v38i4.28094

Get Full-Text PDF Get Analytical Report

Abstract

The goal of this paper is to alleviate the training cost for few-shot semantic segmentation (FSS) models. Despite that FSS in nature improves model generalization to new concepts using only a handful of test exemplars, it relies on strong supervision from a considerable amount of labeled training data for base classes. However, collecting pixel-level annotations is notoriously expensive and time-consuming, and small-scale training datasets convey low information density that limits test-time generalization. To resolve the issue, we take a pioneering step towards label-efficient training of FSS models from fully unlabeled training data, or additionally a few labeled samples to enhance the performance. This motivates an approach based on a novel unsupervised meta-training paradigm. In particular, the approach first distills pre-trained unsupervised pixel embedding into compact semantic clusters from which a massive number of pseudo meta-tasks is constructed. To mitigate the noise in the pseudo meta-tasks, we further advocate a robust Transformer-based FSS model with a novel prototype-based cross-attention design. Extensive experiments have been conducted on two standard benchmarks, i.e., PASCAL-5i and COCO-20i, and the results show that our method produces impressive performance without any annotations, and is comparable to fully supervised competitors even using only 20% of the annotations. Our code is available at: https://github.com/SSSKYue/UMTFSS.

Keywords:

Shot (pellet) Segmentation Artificial intelligence Computer science Training (meteorology) Natural language processing Pattern recognition (psychology) Machine learning Geography Materials science

Metrics

Cited By

2.43

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Machine Learning and Data Classification

Physical Sciences → Computer Science → Artificial Intelligence

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Abstract

Metrics

Citation History

Topics

Related Documents

SML: Semantic meta-learning for few-shot semantic segmentation☆

Unsupervised Semantic Segmentation with Feature Enhancement for Few-shot Image Classification

Iterative Few-shot Semantic Segmentation from Image Label Text

Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts

EyeSeg: Fast and Efficient Few-Shot Semantic Segmentation