Engineering Deep Representations for Modeling Aesthetic Perception

Yanxiang Chen; Yuxing Hu; Luming Zhang; Ping Li; and Chao Zhang

arXiv:1605.07699·cs.CV·August 9, 2019

Engineering Deep Representations for Modeling Aesthetic Perception

Yanxiang Chen, Yuxing Hu, Luming Zhang, Ping Li, and Chao Zhang

PDF

Open Access

TL;DR

This paper introduces a deep learning framework that automatically learns localized aesthetic features from images using weak supervision, improving interpretability and performance in aesthetic tasks.

Contribution

The work develops a novel weakly-supervised deep architecture that learns region-level aesthetic attributes from Flickr images, addressing interpretability and adaptability issues in aesthetic modeling.

Findings

01

Effective localization of aesthetic regions in images.

02

Improved performance in aesthetic ranking and retrieval.

03

Demonstrated competitiveness through subjective and objective tests.

Abstract

Many aesthetic models in computer vision suffer from two shortcomings: 1) the low descriptiveness and interpretability of those hand-crafted aesthetic criteria (i.e., nonindicative of region-level aesthetics), and 2) the difficulty of engineering aesthetic features adaptively and automatically toward different image sets. To remedy these problems, we develop a deep architecture to learn aesthetically-relevant visual attributes from Flickr1, which are localized by multiple textual attributes in a weakly-supervised setting. More specifically, using a bag-ofwords (BoW) representation of the frequent Flickr image tags, a sparsity-constrained subspace algorithm discovers a compact set of textual attributes (e.g., landscape and sunset) for each image. Then, a weakly-supervised learning algorithm projects the textual attributes at image-level to the highly-responsive image patches at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Aesthetic Perception and Analysis · Image Retrieval and Classification Techniques

MethodsInterpretability