TL;DR
EgoKit provides a unified, low-cost solution for egocentric data collection across diverse devices, enabling synchronized video and sensor data with minimal hardware customization.
Contribution
It introduces a toolkit that standardizes egocentric recording workflows across six heterogeneous devices, including synchronized video and sensor data logging.
Findings
Supports six different device types with a unified interface.
Provides synchronized ego-view and wrist-view videos with a common format.
Includes accessories for wrist-view capture without custom hardware.
Abstract
Egocentric video is increasingly used as a data source for robot learning, activity understanding, and embodied AI research, but collecting it at scale remains fragmented in practice: each candidate host device, such as an Android phone, iPhone, iPad, smart glasses, or extended reality (XR) headset, exposes a different SDK, a different policy on raw camera access, and different limitations on external USB cameras and on-device tracking. Synchronized ego-view and wrist-view capture is therefore typically obtained by either committing to a single proprietary platform or building one-off rigs that do not transfer across devices. To address this gap, we present EgoKit, a toolkit that exposes the same egocentric recording workflow across six heterogeneous host devices. Across all supported devices, EgoKit presents the same recording interaction and produces locally stored video with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
