Object Aware Egocentric Online Action Detection
Joungbin An, Yunsu Park, Hyolim Kang, Seon Joo Kim

TL;DR
This paper introduces an Object-Aware Module for egocentric online action detection, leveraging object-specific priors and temporal dynamics to improve scene understanding and action detection in first-person videos.
Contribution
It presents a novel Object-Aware Module tailored for egocentric videos, enhancing existing online action detection frameworks with minimal overhead.
Findings
Improved action detection accuracy on Epic-Kitchens 100 dataset.
Seamless integration with existing models with minimal computational overhead.
Consistent performance improvements across various scenarios.
Abstract
Advancements in egocentric video datasets like Ego4D, EPIC-Kitchens, and Ego-Exo4D have enriched the study of first-person human interactions, which is crucial for applications in augmented reality and assisted living. Despite these advancements, current Online Action Detection methods, which efficiently detect actions in streaming videos, are predominantly designed for exocentric views and thus fail to capitalize on the unique perspectives inherent to egocentric videos. To address this gap, we introduce an Object-Aware Module that integrates egocentric-specific priors into existing OAD frameworks, enhancing first-person footage interpretation. Utilizing object-specific details and temporal dynamics, our module improves scene understanding in detecting actions. Validated extensively on the Epic-Kitchens 100 dataset, our work can be seamlessly integrated into existing models with minimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Analysis and Summarization · Anomaly Detection Techniques and Applications
