Tracking Fast by Learning Slow: An Event-based Speed Adaptive Hand Tracker Leveraging Knowledge in RGB Domain
Chuanlin Lan, Ziyuan Yin, Arindam Basu, Rosa H. M. Chan

TL;DR
This paper introduces an event-based speed adaptive hand tracker that leverages RGB knowledge, constructed a new dataset, and employs novel data augmentation and segmentation methods to improve fast hand tracking accuracy.
Contribution
The paper presents the first 3D hand tracking dataset from an event camera and proposes a CNN model with domain adaptation, data augmentation, and event stream segmentation for fast hand tracking.
Findings
Outperforms RGB-based hand tracking methods in fast motion scenarios.
Outperforms previous event-based solutions in speed and accuracy.
Demonstrates effectiveness of domain adaptation and data augmentation techniques.
Abstract
3D hand tracking methods based on monocular RGB videos are easily affected by motion blur, while event camera, a sensor with high temporal resolution and dynamic range, is naturally suitable for this task with sparse output and low power consumption. However, obtaining 3D annotations of fast-moving hands is difficult for constructing event-based hand-tracking datasets. In this paper, we provided an event-based speed adaptive hand tracker (ESAHT) to solve the hand tracking problem based on event camera. We enabled a CNN model trained on a hand tracking dataset with slow motion, which enabled the model to leverage the knowledge of RGB-based hand tracking solutions, to work on fast hand tracking tasks. To realize our solution, we constructed the first 3D hand tracking dataset captured by an event camera in a real-world environment, figured out two data augment methods to narrow the domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · EEG and Brain-Computer Interfaces
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
