Exploring Temporally-Aware Features for Point Tracking
In\`es Hyeonsu Kim, Seokju Cho, Jiahui Huang, Jung Yi, Joon-Young Lee,, Seungryong Kim

TL;DR
This paper introduces Chrono, a temporally-aware feature backbone for point tracking that leverages pre-trained representations and a temporal adapter to achieve state-of-the-art results without refinement, improving efficiency and robustness.
Contribution
The paper presents Chrono, a novel backbone with built-in temporal awareness for point tracking, reducing reliance on multi-stage refinement and enhancing performance with pre-trained features.
Findings
Chrono achieves state-of-the-art results on TAP-Vid-DAVIS and TAP-Vid-Kinetics datasets.
Chrono outperforms common feature backbones in point tracking tasks.
Chrono operates efficiently without the need for a refinement stage.
Abstract
Point tracking in videos is a fundamental task with applications in robotics, video editing, and more. While many vision tasks benefit from pre-trained feature backbones to improve generalizability, point tracking has primarily relied on simpler backbones trained from scratch on synthetic data, which may limit robustness in real-world scenarios. Additionally, point tracking requires temporal awareness to ensure coherence across frames, but using temporally-aware features is still underexplored. Most current methods often employ a two-stage process: an initial coarse prediction followed by a refinement stage to inject temporal information and correct errors from the coarse stage. These approach, however, is computationally expensive and potentially redundant if the feature backbone itself captures sufficient temporal information. In this work, we introduce Chrono, a feature backbone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Video Analysis and Summarization · Data Management and Algorithms
