Aesthetic Attributes Assessment of Images
Xin Jin, Le Wu, Geng Zhao, Xiaodong Li, Xiaokun Zhang, Shiming Ge,, Dongqing Zou, Bin Zhou, Xinghui Zhou

TL;DR
This paper introduces a novel approach for assessing image aesthetic attributes by predicting descriptive captions and scores for multiple attributes, utilizing a new dataset and a transfer learning-based multi-attribute network.
Contribution
It proposes a new aesthetic attributes captioning framework, a novel dataset, and a multi-attribute network that jointly predicts captions and scores, advancing image aesthetic assessment methods.
Findings
Our method accurately predicts aesthetic attribute captions and scores.
The proposed AMAN outperforms traditional CNN-LSTM and SCA-CNN models.
Experimental results validate the effectiveness of transfer learning in this task.
Abstract
Image aesthetic quality assessment has been a relatively hot topic during the last decade. Most recently, comments type assessment (aesthetic captions) has been proposed to describe the general aesthetic impression of an image using text. In this paper, we propose Aesthetic Attributes Assessment of Images, which means the aesthetic attributes captioning. This is a new formula of image aesthetic assessment, which predicts aesthetic attributes captions together with the aesthetic score of each attribute. We introduce a new dataset named \emph{DPC-Captions} which contains comments of up to 5 aesthetic attributes of one image through knowledge transfer from a full-annotated small-scale dataset. Then, we propose Aesthetic Multi-Attribute Network (AMAN), which is trained on a mixture of fully-annotated small-scale PCCD dataset and weakly-annotated large-scale DPC-Captions dataset. Our AMAN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
MethodsSpatial and Channel-wise Attention-based Convolutional Neural Network
