Tracking Fast by Learning Slow: An Event-based Speed Adaptive Hand   Tracker Leveraging Knowledge in RGB Domain

Chuanlin Lan; Ziyuan Yin; Arindam Basu; Rosa H. M. Chan

arXiv:2302.14430·cs.CV·March 1, 2023·1 cites

Tracking Fast by Learning Slow: An Event-based Speed Adaptive Hand Tracker Leveraging Knowledge in RGB Domain

Chuanlin Lan, Ziyuan Yin, Arindam Basu, Rosa H. M. Chan

PDF

Open Access

TL;DR

This paper introduces an event-based speed adaptive hand tracker that leverages RGB knowledge, constructed a new dataset, and employs novel data augmentation and segmentation methods to improve fast hand tracking accuracy.

Contribution

The paper presents the first 3D hand tracking dataset from an event camera and proposes a CNN model with domain adaptation, data augmentation, and event stream segmentation for fast hand tracking.

Findings

01

Outperforms RGB-based hand tracking methods in fast motion scenarios.

02

Outperforms previous event-based solutions in speed and accuracy.

03

Demonstrates effectiveness of domain adaptation and data augmentation techniques.

Abstract

3D hand tracking methods based on monocular RGB videos are easily affected by motion blur, while event camera, a sensor with high temporal resolution and dynamic range, is naturally suitable for this task with sparse output and low power consumption. However, obtaining 3D annotations of fast-moving hands is difficult for constructing event-based hand-tracking datasets. In this paper, we provided an event-based speed adaptive hand tracker (ESAHT) to solve the hand tracking problem based on event camera. We enabled a CNN model trained on a hand tracking dataset with slow motion, which enabled the model to leverage the knowledge of RGB-based hand tracking solutions, to work on fast hand tracking tasks. To realize our solution, we constructed the first 3D hand tracking dataset captured by an event camera in a real-world environment, figured out two data augment methods to narrow the domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · EEG and Brain-Computer Interfaces

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings