Detecting Human-to-Human-or-Object (H2O) Interactions with DIABOLO
Astrid Orcesi, Romaric Audigier, Fritz Poka Toukam, Bertrand, Luvison

TL;DR
This paper introduces a new dataset and a single-shot detection method for comprehensive human interaction analysis, including human-to-human and human-to-object interactions, improving efficiency and performance.
Contribution
It presents a novel H2O interaction dataset with a new verb taxonomy and a single-shot detection model, DIABOLO, that efficiently detects all interactions with constant inference time.
Findings
DIABOLO outperforms state-of-the-art methods on V-COCO dataset.
The new dataset includes more interactions and a refined verb taxonomy.
Sharing detection tasks improves performance and reduces computation.
Abstract
Detecting human interactions is crucial for human behavior analysis. Many methods have been proposed to deal with Human-to-Object Interaction (HOI) detection, i.e., detecting in an image which person and object interact together and classifying the type of interaction. However, Human-to-Human Interactions, such as social and violent interactions, are generally not considered in available HOI training datasets. As we think these types of interactions cannot be ignored and decorrelated from HOI when analyzing human behavior, we propose a new interaction dataset to deal with both types of human interactions: Human-to-Human-or-Object (H2O). In addition, we introduce a novel taxonomy of verbs, intended to be closer to a description of human body attitude in relation to the surrounding targets of interaction, and more independent of the environment. Unlike some existing datasets, we strive to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
