Learning Human-Object Interaction Detection using Interaction Points
Tiancai Wang, Tong Yang, Martin Danelljan, Fahad Shahbaz Khan, and Xiangyu Zhang, Jian Sun

TL;DR
This paper introduces a novel fully-convolutional method for human-object interaction detection that predicts interaction points and vectors, achieving state-of-the-art results on V-COCO and HICO-DET benchmarks.
Contribution
It proposes the first approach to treat HOI detection as a keypoint detection and grouping problem, improving accuracy over previous instance-centric methods.
Findings
Sets new state-of-the-art on V-COCO and HICO-DET datasets.
Introduces a fully-convolutional network predicting interaction points and vectors.
Demonstrates the effectiveness of keypoint-based HOI detection.
Abstract
Understanding interactions between humans and objects is one of the fundamental problems in visual classification and an essential step towards detailed scene understanding. Human-object interaction (HOI) detection strives to localize both the human and an object as well as the identification of complex interactions between them. Most existing HOI detection approaches are instance-centric where interactions between all possible human-object pairs are predicted based on appearance features and coarse spatial information. We argue that appearance features alone are insufficient to capture complex human-object interactions. In this paper, we therefore propose a novel fully-convolutional approach that directly detects the interactions between human-object pairs. Our network predicts interaction points, which directly localize and classify the inter-action. Paired with the densely predicted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Learning Human-Object Interaction Detection Using Interaction Points· youtube
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Human Pose and Action Recognition
