TL;DR
This paper introduces a novel asynchronous feature tracking method that combines event cameras and standard frames, leveraging their complementary strengths to achieve high-accuracy, low-latency visual tracking.
Contribution
It presents the first principled, generative model-based approach for directly using raw intensity measurements from combined event and frame data.
Findings
More accurate feature tracks than previous methods
Longer feature tracks across diverse scenes
Subpixel accuracy in feature tracking
Abstract
We present a method that leverages the complementarity of event cameras and standard cameras to track visual features with low-latency. Event cameras are novel sensors that output pixel-level brightness changes, called "events". They offer significant advantages over standard cameras, namely a very high dynamic range, no motion blur, and a latency in the order of microseconds. However, because the same scene pattern can produce different events depending on the motion direction, establishing event correspondences across time is challenging. By contrast, standard cameras provide intensity measurements (frames) that do not depend on motion direction. Our method extracts features on frames and subsequently tracks them asynchronously using events, thereby exploiting the best of both types of data: the frames provide a photometric representation that does not depend on motion direction and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
