Distance Matters in Human-Object Interaction Detection
Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli

TL;DR
This paper introduces a novel two-stage approach with a distance-aware attention module and loss function to improve human-object interaction detection, especially for distant interactions, achieving state-of-the-art results.
Contribution
It proposes a new distance-aware attention mechanism and loss function specifically designed to enhance detection of distant human-object interactions.
Findings
Significant performance improvement on HICO-DET and V-COCO datasets.
Outperforms existing methods by a large margin.
Achieves new state-of-the-art results in HOI detection.
Abstract
Human-Object Interaction (HOI) detection has received considerable attention in the context of scene understanding. Despite the growing progress on benchmarks, we realize that existing methods often perform unsatisfactorily on distant interactions, where the leading causes are two-fold: 1) Distant interactions are by nature more difficult to recognize than close ones. A natural scene often involves multiple humans and objects with intricate spatial relations, making the interaction recognition for distant human-object largely affected by complex visual context. 2) Insufficient number of distant interactions in benchmark datasets results in under-fitting on these instances. To address these problems, in this paper, we propose a novel two-stage method for better handling distant interactions in HOI detection. One essential component in our method is a novel Far Near Distance Attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
