Portrait Interpretation and a Benchmark
Yixuan Fan, Zhaopeng Dou, Yali Li, Shengjin Wang

TL;DR
This paper introduces Portrait Interpretation, a new multi-task learning framework with a large dataset, aiming to comprehensively perceive human portraits by integrating appearance, posture, and emotion recognition.
Contribution
It defines a novel portrait interpretation task, constructs a large diverse dataset, and proposes a baseline model and metric for systematic perception of portraits.
Findings
Multi-task learning improves portrait perception accuracy.
The dataset covers diverse human attributes from movies.
Combining related tasks benefits overall performance.
Abstract
We propose a task we name Portrait Interpretation and construct a dataset named Portrait250K for it. Current researches on portraits such as human attribute recognition and person re-identification have achieved many successes, but generally, they: 1) may lack mining the interrelationship between various tasks and the possible benefits it may bring; 2) design deep models specifically for each task, which is inefficient; 3) may be unable to cope with the needs of a unified model and comprehensive perception in actual scenes. In this paper, the proposed portrait interpretation recognizes the perception of humans from a new systematic perspective. We divide the perception of portraits into three aspects, namely Appearance, Posture, and Emotion, and design corresponding sub-tasks for each aspect. Based on the framework of multi-task learning, portrait interpretation requires a comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Face recognition and analysis · Multimodal Machine Learning Applications
