Matching Anything by Segmenting Anything
Siyuan Li, Lei Ke, Martin Danelljan, Luigi Piccinelli, Mattia Segu,, Luc Van Gool, Fisher Yu

TL;DR
MASA leverages the Segment Anything Model to enable zero-shot object tracking across diverse domains without labeled video data, outperforming existing methods on multiple benchmarks.
Contribution
Introduces MASA, a novel zero-shot instance association method using SAM for robust cross-domain object matching without tracking labels.
Findings
Achieves superior zero-shot tracking performance on MOT benchmarks.
Uses only unlabeled static images for training, reducing data annotation needs.
Demonstrates strong generalization across complex video domains.
Abstract
The robust association of the same objects across video frames in complex scenes is crucial for many applications, especially Multiple Object Tracking (MOT). Current methods predominantly rely on labeled domain-specific video datasets, which limits the cross-domain generalization of learned similarity embeddings. We propose MASA, a novel method for robust instance association learning, capable of matching any objects within videos across diverse domains without tracking labels. Leveraging the rich object segmentation from the Segment Anything Model (SAM), MASA learns instance-level correspondence through exhaustive data transformations. We treat the SAM outputs as dense object region proposals and learn to match those regions from a vast image collection. We further design a universal MASA adapter which can work in tandem with foundational segmentation or detection models and enable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Machine Learning and Data Classification · Context-Aware Activity Recognition Systems
MethodsAdapter · Segment Anything Model
