Decoupled Prototype Matching with Vision Foundation Models for Few-Shot Industrial Object Detection
Hari Prasanth S. M., Nilusha Jayawickrama, Risto Ojala

TL;DR
This paper introduces a few-shot industrial object detection framework using vision foundation models that constructs class prototypes from limited samples and matches query features for recognition, enabling efficient onboarding of new objects.
Contribution
The proposed method leverages vision foundation models for prototype-based few-shot detection, eliminating the need for large datasets or CAD models, and demonstrates competitive performance on industrial datasets.
Findings
Improves AP by 6.9% over state-of-the-art training-free methods.
Capable of onboarding new objects with only a few reference images.
Achieves competitive detection performance on industrial datasets.
Abstract
Industrial object detection systems typically rely on large annotated datasets, which are expensive to collect and challenging to maintain in industrial scenarios where the inventory of objects changes frequently. This work addresses the challenge of few-shot object detection in such industrial scenarios, where only a limited number of labeled samples are available for newly introduced objects. We present a detection framework that leverages vision foundation models to recognize objects with minimal supervision. The method constructs class prototypes from a small set of reference samples by extracting feature representations. For a given query scene during inference, object regions are generated using a segmentation model, and feature embeddings are extracted and matched with class prototypes using similarity matching. We evaluate the detection method on three established industrial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
