Portrait Interpretation and a Benchmark

Yixuan Fan; Zhaopeng Dou; Yali Li; Shengjin Wang

arXiv:2207.13315·cs.CV·July 28, 2022

Portrait Interpretation and a Benchmark

Yixuan Fan, Zhaopeng Dou, Yali Li, Shengjin Wang

PDF

Open Access

TL;DR

This paper introduces Portrait Interpretation, a new multi-task learning framework with a large dataset, aiming to comprehensively perceive human portraits by integrating appearance, posture, and emotion recognition.

Contribution

It defines a novel portrait interpretation task, constructs a large diverse dataset, and proposes a baseline model and metric for systematic perception of portraits.

Findings

01

Multi-task learning improves portrait perception accuracy.

02

The dataset covers diverse human attributes from movies.

03

Combining related tasks benefits overall performance.

Abstract

We propose a task we name Portrait Interpretation and construct a dataset named Portrait250K for it. Current researches on portraits such as human attribute recognition and person re-identification have achieved many successes, but generally, they: 1) may lack mining the interrelationship between various tasks and the possible benefits it may bring; 2) design deep models specifically for each task, which is inefficient; 3) may be unable to cope with the needs of a unified model and comprehensive perception in actual scenes. In this paper, the proposed portrait interpretation recognizes the perception of humans from a new systematic perspective. We divide the perception of portraits into three aspects, namely Appearance, Posture, and Emotion, and design corresponding sub-tasks for each aspect. Based on the framework of multi-task learning, portrait interpretation requires a comprehensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Face recognition and analysis · Multimodal Machine Learning Applications