Match4Annotate: Propagating Sparse Video Annotations via Implicit Neural Feature Matching
Zhuorui Zhang, Roger Pallar\`es-L\'opez, Praneeth Namburi, Brian W. Anthony

TL;DR
Match4Annotate introduces a novel neural feature matching framework that efficiently propagates annotations across video frames and videos, significantly improving scalability and accuracy in medical imaging annotation tasks.
Contribution
The paper presents a lightweight, test-time optimized neural feature matching method that unifies intra- and inter-video annotation propagation, overcoming limitations of existing approaches.
Findings
Achieves state-of-the-art inter-video propagation performance.
Outperforms traditional feature matching and segmentation baselines.
Remains competitive with specialized trackers for intra-video tasks.
Abstract
Acquiring per-frame video annotations remains a primary bottleneck for deploying computer vision in specialized domains such as medical imaging, where expert labeling is slow and costly. Label propagation offers a natural solution, yet existing approaches face fundamental limitations. Video trackers and segmentation models can propagate labels within a single sequence but require per-video initialization and cannot generalize across videos. Classic correspondence pipelines operate on detector-chosen keypoints and struggle in low-texture scenes, while dense feature matching and one-shot segmentation methods enable cross-video propagation but lack spatiotemporal smoothness and unified support for both point and mask annotations. We present Match4Annotate, a lightweight framework for both intra-video and inter-video propagation of point and mask annotations. Our method fits a SIREN-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
