Aesthetic Attributes Assessment of Images

Xin Jin; Le Wu; Geng Zhao; Xiaodong Li; Xiaokun Zhang; Shiming Ge,; Dongqing Zou; Bin Zhou; Xinghui Zhou

arXiv:1907.04983·cs.CV·July 30, 2019·1 cites

Aesthetic Attributes Assessment of Images

Xin Jin, Le Wu, Geng Zhao, Xiaodong Li, Xiaokun Zhang, Shiming Ge,, Dongqing Zou, Bin Zhou, Xinghui Zhou

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel approach for assessing image aesthetic attributes by predicting descriptive captions and scores for multiple attributes, utilizing a new dataset and a transfer learning-based multi-attribute network.

Contribution

It proposes a new aesthetic attributes captioning framework, a novel dataset, and a multi-attribute network that jointly predicts captions and scores, advancing image aesthetic assessment methods.

Findings

01

Our method accurately predicts aesthetic attribute captions and scores.

02

The proposed AMAN outperforms traditional CNN-LSTM and SCA-CNN models.

03

Experimental results validate the effectiveness of transfer learning in this task.

Abstract

Image aesthetic quality assessment has been a relatively hot topic during the last decade. Most recently, comments type assessment (aesthetic captions) has been proposed to describe the general aesthetic impression of an image using text. In this paper, we propose Aesthetic Attributes Assessment of Images, which means the aesthetic attributes captioning. This is a new formula of image aesthetic assessment, which predicts aesthetic attributes captions together with the aesthetic score of each attribute. We introduce a new dataset named \emph{DPC-Captions} which contains comments of up to 5 aesthetic attributes of one image through knowledge transfer from a full-annotated small-scale dataset. Then, we propose Aesthetic Multi-Attribute Network (AMAN), which is trained on a mixture of fully-annotated small-scale PCCD dataset and weakly-annotated large-scale DPC-Captions dataset. Our AMAN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques

MethodsSpatial and Channel-wise Attention-based Convolutional Neural Network