Match4Annotate: Propagating Sparse Video Annotations via Implicit Neural Feature Matching

Zhuorui Zhang; Roger Pallar\`es-L\'opez; Praneeth Namburi; Brian W. Anthony

arXiv:2603.06471·cs.CV·March 18, 2026

Match4Annotate: Propagating Sparse Video Annotations via Implicit Neural Feature Matching

Zhuorui Zhang, Roger Pallar\`es-L\'opez, Praneeth Namburi, Brian W. Anthony

PDF

Open Access

TL;DR

Match4Annotate introduces a novel neural feature matching framework that efficiently propagates annotations across video frames and videos, significantly improving scalability and accuracy in medical imaging annotation tasks.

Contribution

The paper presents a lightweight, test-time optimized neural feature matching method that unifies intra- and inter-video annotation propagation, overcoming limitations of existing approaches.

Findings

01

Achieves state-of-the-art inter-video propagation performance.

02

Outperforms traditional feature matching and segmentation baselines.

03

Remains competitive with specialized trackers for intra-video tasks.

Abstract

Acquiring per-frame video annotations remains a primary bottleneck for deploying computer vision in specialized domains such as medical imaging, where expert labeling is slow and costly. Label propagation offers a natural solution, yet existing approaches face fundamental limitations. Video trackers and segmentation models can propagate labels within a single sequence but require per-video initialization and cannot generalize across videos. Classic correspondence pipelines operate on detector-chosen keypoints and struggle in low-texture scenes, while dense feature matching and one-shot segmentation methods enable cross-video propagation but lack spatiotemporal smoothness and unified support for both point and mask annotations. We present Match4Annotate, a lightweight framework for both intra-video and inter-video propagation of point and mask annotations. Our method fits a SIREN-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques