Enhancing Social Relation Inference with Concise Interaction Graph and Discriminative Scene Representation
Xiaotian Yu, Hanling Yi, Yi Yu, Ling Xing, Shiliang Zhang, Xiaoyu Wang

TL;DR
This paper introduces PRISE, a new method that combines concise interaction graphs and discriminative scene features to improve social relation inference from images, outperforming existing approaches.
Contribution
The paper proposes a simple relational graph convolutional network and a contrastive learning task for holistic scene understanding, along with a large-scale dataset for social relation inference.
Findings
PRISE achieves 6.8% improvement in domain classification on PIPA.
The method outperforms state-of-the-art approaches.
The large-scale dataset enhances social relation inference.
Abstract
There has been a recent surge of research interest in attacking the problem of social relation inference based on images. Existing works classify social relations mainly by creating complicated graphs of human interactions, or learning the foreground and/or background information of persons and objects, but ignore holistic scene context. The holistic scene refers to the functionality of a place in images, such as dinning room, playground and office. In this paper, by mimicking human understanding on images, we propose an approach of \textbf{PR}actical \textbf{I}nference in \textbf{S}ocial r\textbf{E}lation (PRISE), which concisely learns interactive features of persons and discriminative features of holistic scenes. Technically, we develop a simple and fast relational graph convolutional network to capture interactive features of all persons in one image. To learn the holistic scene…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
