EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Matthias De Lange, Hamid Eghbalzadeh, Reuben Tan, Michael Iuzzolino,, Franziska Meier, Karl Ridgeway

TL;DR
EgoAdapt introduces a real-world benchmark and evaluation framework for adaptive egocentric action recognition, enabling models to personalize and improve performance through online adaptation on user data streams.
Contribution
The paper presents EgoAdapt, a new benchmark and evaluation method for on-device, online adaptation in egocentric video recognition, addressing real-world challenges like distribution shifts and long-tailed actions.
Findings
Finetuning improves adaptation performance.
Experience replay enhances online generalization.
Meta-evaluation provides robust assessment across user streams.
Abstract
In egocentric action recognition a single population model is typically trained and subsequently embodied on a head-mounted device, such as an augmented reality headset. While this model remains static for new users and environments, we introduce an adaptive paradigm of two phases, where after pretraining a population model, the model adapts on-device and online to the user's experience. This setting is highly challenging due to the change from population to user domain and the distribution shifts in the user's data stream. Coping with the latter in-stream distribution shifts is the focus of continual learning, where progress has been rooted in controlled benchmarks but challenges faced in real-world applications often remain unaddressed. We introduce EgoAdapt, a benchmark for real-world egocentric action recognition that facilitates our two-phased adaptive paradigm, and real-world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Context-Aware Activity Recognition Systems
MethodsFocus
