Egocentric Human-Object Interaction Detection Exploiting Synthetic Data
Rosario Leonardi, Francesco Ragusa, Antonino Furnari, and Giovanni, Maria Farinella

TL;DR
This paper introduces a synthetic data generation pipeline for detecting egocentric human-object interactions in industrial settings, demonstrating improved detection performance with limited real data.
Contribution
The authors propose a novel synthetic data generation tool and pipeline for EHOI detection, along with a method that leverages this data to improve detection accuracy.
Findings
Synthetic data enhances EHOI detection performance.
Method performs well with limited real data.
Public dataset release supports further research.
Abstract
We consider the problem of detecting Egocentric HumanObject Interactions (EHOIs) in industrial contexts. Since collecting and labeling large amounts of real images is challenging, we propose a pipeline and a tool to generate photo-realistic synthetic First Person Vision (FPV) images automatically labeled for EHOI detection in a specific industrial scenario. To tackle the problem of EHOI detection, we propose a method that detects the hands, the objects in the scene, and determines which objects are currently involved in an interaction. We compare the performance of our method with a set of state-of-the-art baselines. Results show that using a synthetic dataset improves the performance of an EHOI detection system, especially when few real data are available. To encourage research on this topic, we publicly release the proposed dataset at the following url:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Advanced Neural Network Applications
