Enhancing Social Relation Inference with Concise Interaction Graph and   Discriminative Scene Representation

Xiaotian Yu; Hanling Yi; Yi Yu; Ling Xing; Shiliang Zhang; Xiaoyu Wang

arXiv:2107.14425·cs.CV·August 2, 2021

Enhancing Social Relation Inference with Concise Interaction Graph and Discriminative Scene Representation

Xiaotian Yu, Hanling Yi, Yi Yu, Ling Xing, Shiliang Zhang, Xiaoyu Wang

PDF

Open Access

TL;DR

This paper introduces PRISE, a new method that combines concise interaction graphs and discriminative scene features to improve social relation inference from images, outperforming existing approaches.

Contribution

The paper proposes a simple relational graph convolutional network and a contrastive learning task for holistic scene understanding, along with a large-scale dataset for social relation inference.

Findings

01

PRISE achieves 6.8% improvement in domain classification on PIPA.

02

The method outperforms state-of-the-art approaches.

03

The large-scale dataset enhances social relation inference.

Abstract

There has been a recent surge of research interest in attacking the problem of social relation inference based on images. Existing works classify social relations mainly by creating complicated graphs of human interactions, or learning the foreground and/or background information of persons and objects, but ignore holistic scene context. The holistic scene refers to the functionality of a place in images, such as dinning room, playground and office. In this paper, by mimicking human understanding on images, we propose an approach of \textbf{PR}actical \textbf{I}nference in \textbf{S}ocial r\textbf{E}lation (PRISE), which concisely learns interactive features of persons and discriminative features of holistic scenes. Technically, we develop a simple and fast relational graph convolutional network to capture interactive features of all persons in one image. To learn the holistic scene…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning