BOOK-CHAPTER

Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts

Abstract

Few-shot semantic segmentation aims to segment objects from previously unseen classes using only a limited number of labeled examples. In this paper, we introduce Label Anything, a novel transformer-based architecture designed for multi-prompt, multi-way few-shot semantic segmentation. Our approach leverages diverse visual prompts—points, bounding boxes, and masks—to create a highly flexible and generalizable framework that significantly reduces annotation burden while maintaining high accuracy. Label Anything makes three key contributions: (i) we introduce a new task formulation that relaxes conventional few-shot segmentation constraints by supporting various types of prompts, multi-class classification, and enabling multiple prompts within a single image; (ii) we propose a novel architecture based on transformers and attention mechanisms; and (iii) we design a versatile training procedure allowing our model to operate seamlessly across different N-way K-shot and prompt-type configurations with a single trained model. Our extensive experimental evaluation on the widely used COCO-20i benchmark demonstrates that Label Anything achieves state-of-the-art performance among existing multi-way few-shot segmentation methods, while significantly outperforming leading single-class models when evaluated in multi-class settings. Code and trained models are available at https://github.com/pasqualedem/LabelAnything.

Keywords:

Metrics

2
Cited By
22.96
FWCI (Field Weighted Citation Impact)
0
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Prototype Queue Learning for Multi-Class Few-Shot Semantic Segmentation

Zichao WangZhiyu JiangYuan Yuan

Journal:   2022 IEEE International Conference on Image Processing (ICIP) Year: 2022 Pages: 1721-1725
JOURNAL ARTICLE

Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Jianwu LiKaiyue ShiGuo-Sen XieXiaofeng LiuJian ZhangTianfei Zhou

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2024 Vol: 38 (4)Pages: 3109-3117
JOURNAL ARTICLE

Iterative Few-shot Semantic Segmentation from Image Label Text

Haohan WangLiang LiuWuhao ZhangJiangning ZhangZhenye GanYabiao WangShuo WangHaoqian Wang

Journal:   Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Year: 2022 Pages: 1385-1392
© 2026 ScienceGate Book Chapters — All rights reserved.