TL;DR
This paper introduces a multi-task attention network that predicts both the mean aesthetic score and user disagreement, improving image aesthetic ranking by accounting for subjective opinion variability.
Contribution
It proposes a novel end-to-end model and confidence interval ranking loss to better capture user disagreement in aesthetic quality prediction.
Findings
Achieves state-of-the-art results on AVA and TMGA datasets.
Effectively models user disagreement and uncertainty in aesthetic scoring.
Improves ranking robustness by considering confidence intervals.
Abstract
How to robustly rank the aesthetic quality of given images has been a long-standing ill-posed topic. Such challenge stems mainly from the diverse subjective opinions of different observers about the varied types of content. There is a growing interest in estimating the user agreement by considering the standard deviation of the scores, instead of only predicting the mean aesthetic opinion score. Nevertheless, when comparing a pair of contents, few studies consider how confident are we regarding the difference in the aesthetic scores. In this paper, we thus propose (1) a re-adapted multi-task attention network to predict both the mean opinion score and the standard deviation in an end-to-end manner; (2) a brand-new confidence interval ranking loss that encourages the model to focus on image-pairs that are less certain about the difference of their aesthetic scores. With such loss, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
