Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts
Pasquale De Marinis, Nicola Fanelli, Raffaele Scaringi, Emanuele Colonna, Giuseppe Fiameni, Gennaro Vessio, Giovanna Castellano

TL;DR
Label Anything introduces a transformer-based framework for multi-class, multi-prompt few-shot semantic segmentation, enabling flexible, prompt-based object segmentation with high accuracy and reduced annotation effort.
Contribution
It proposes a new task formulation, a transformer-based architecture, and a versatile training method for multi-class, multi-prompt few-shot segmentation.
Findings
Achieves state-of-the-art results on COCO-20i benchmark.
Outperforms single-class models in multi-class settings.
Supports various prompt types within a single model.
Abstract
Few-shot semantic segmentation aims to segment objects from previously unseen classes using only a limited number of labeled examples. In this paper, we introduce Label Anything, a novel transformer-based architecture designed for multi-prompt, multi-way few-shot semantic segmentation. Our approach leverages diverse visual prompts -- points, bounding boxes, and masks -- to create a highly flexible and generalizable framework that significantly reduces annotation burden while maintaining high accuracy. Label Anything makes three key contributions: () we introduce a new task formulation that relaxes conventional few-shot segmentation constraints by supporting various types of prompts, multi-class classification, and enabling multiple prompts within a single image; () we propose a novel architecture based on transformers and attention mechanisms; and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗pasqualedem/label_anything_sam_1024_cocomodel· 9 dl9 dl
- 🤗pasqualedem/label_anything_256_sam_1024_cocomodel
- 🤗pasqualedem/label_anything_256_oneway_sam_1024_cocomodel· 1 dl1 dl
- 🤗pasqualedem/label_anything_coco_fold0_mae_7a5p0t63model· 8 dl8 dl
- 🤗pasqualedem/label_anything_mae_480_cocomodel· 1 dl1 dl
- 🤗pasqualedem/label_anything_coco_fold1_mae_coh54ws0model
- 🤗pasqualedem/label_anything_coco_fold2_mae_2pnppb7kmodel
- 🤗pasqualedem/label_anything_coco_fold3_mae_a2gk7tetmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
MethodsSparse Evolutionary Training
