Egocentric Human-Object Interaction Detection: A New Benchmark and Method
Kunyuan Deng, Yi Wang, and Lap-Pui Chau

TL;DR
This paper introduces a new egocentric human-object interaction detection benchmark dataset and a novel method leveraging hand geometry to improve detection accuracy under occlusion.
Contribution
The paper presents Ego-HOIBench, a comprehensive egocentric HOI dataset, and proposes HGIR, a novel approach utilizing hand pose and geometry for better interaction detection.
Findings
Ego-HOIBench contains over 27K images with detailed annotations.
Existing detectors perform poorly on egocentric data, highlighting the need for specialized methods.
HGIR achieves state-of-the-art results on Ego-HOIBench.
Abstract
Egocentric human-object interaction (Ego-HOI) detection is crucial for intelligent agents to understand and assist human activities from a first-person perspective. However, progress has been hindered by the lack of benchmarks and methods tailored to egocentric challenges such as severe hand-object occlusion. In this paper, we introduce the real-world Ego-HOI detection task and the accompanying Ego-HOIBench, a new dataset with over 27K egocentric images and explicit, fine-grained hand-verb-object triplet annotations across 123 categories. Ego-HOIBench covers diverse daily scenarios, object types, and both single- and two-hand interactions, offering a comprehensive testbed for Ego-HOI research. Benchmarking existing third-person HOI detectors on Ego-HOIBench reveals significant performance gaps, highlighting the need for egocentric-specific solutions. To this end, we propose Hand…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Anomaly Detection Techniques and Applications
MethodsFocus
