Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection
Guangzhi Wang, Yangyang Guo, Mohan Kankanhalli

TL;DR
This paper introduces a novel Part Semantic Network with a Conditional Part Attention mechanism and an Occluded Part Extrapolation strategy to improve human-object interaction detection, especially under occlusion, without relying on external annotations.
Contribution
The paper proposes a new Part Semantic Network with CPA and OPE strategies, enabling better interaction recognition and occlusion handling without external data.
Findings
Outperforms prior methods on V-COCO and HICO-DET datasets
Effectively recognizes subtle human-object interactions
Handles occlusion scenarios with extrapolation
Abstract
Human-Object Interaction Detection is a crucial aspect of human-centric scene understanding, with important applications in various domains. Despite recent progress in this field, recognizing subtle and detailed interactions remains challenging. Existing methods try to use human-related clues to alleviate the difficulty, but rely heavily on external annotations or knowledge, limiting their practical applicability in real-world scenarios. In this work, we propose a novel Part Semantic Network (PSN) to solve this problem. The core of PSN is a Conditional Part Attention (CPA) mechanism, where human features are taken as keys and values, and the object feature is used as query for the computation in a cross-attention mechanism. In this way, our model learns to automatically focus on the most informative human parts conditioned on the involved object, generating more semantically meaningful…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Advanced Neural Network Applications
MethodsFocus
