Engineering Deep Representations for Modeling Aesthetic Perception
Yanxiang Chen, Yuxing Hu, Luming Zhang, Ping Li, and Chao Zhang

TL;DR
This paper introduces a deep learning framework that automatically learns localized aesthetic features from images using weak supervision, improving interpretability and performance in aesthetic tasks.
Contribution
The work develops a novel weakly-supervised deep architecture that learns region-level aesthetic attributes from Flickr images, addressing interpretability and adaptability issues in aesthetic modeling.
Findings
Effective localization of aesthetic regions in images.
Improved performance in aesthetic ranking and retrieval.
Demonstrated competitiveness through subjective and objective tests.
Abstract
Many aesthetic models in computer vision suffer from two shortcomings: 1) the low descriptiveness and interpretability of those hand-crafted aesthetic criteria (i.e., nonindicative of region-level aesthetics), and 2) the difficulty of engineering aesthetic features adaptively and automatically toward different image sets. To remedy these problems, we develop a deep architecture to learn aesthetically-relevant visual attributes from Flickr1, which are localized by multiple textual attributes in a weakly-supervised setting. More specifically, using a bag-ofwords (BoW) representation of the frequent Flickr image tags, a sparsity-constrained subspace algorithm discovers a compact set of textual attributes (e.g., landscape and sunset) for each image. Then, a weakly-supervised learning algorithm projects the textual attributes at image-level to the highly-responsive image patches at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Aesthetic Perception and Analysis · Image Retrieval and Classification Techniques
MethodsInterpretability
