TL;DR
This paper introduces a deep learning approach that maps images into a high-dimensional aesthetic space to predict visual appeal, leveraging large-scale user data for fine-grained aesthetic assessment and achieving state-of-the-art results.
Contribution
It presents a novel high-dimensional aesthetic space learned via deep learning, incorporating diverse user preferences without handcrafted features, and demonstrates practical applications.
Findings
Achieved state-of-the-art accuracy on AVA benchmark.
Successfully modeled multi-user aesthetic preferences.
Enabled applications like photo sorting and key-frame extraction.
Abstract
Rating how aesthetically pleasing an image appears is a highly complex matter and depends on a large number of different visual factors. Previous work has tackled the aesthetic rating problem by ranking on a 1-dimensional rating scale, e.g., incorporating handcrafted attributes. In this paper, we propose a rather general approach to automatically map aesthetic pleasingness with all its complexity into an "aesthetic space" to allow for a highly fine-grained resolution. In detail, making use of deep learning, our method directly learns an encoding of a given image into this high-dimensional feature space resembling visual aesthetics. Additionally to the mentioned visual factors, differences in personal judgments have a large impact on the likeableness of a photograph. Nowadays, online platforms allow users to "like" or favor certain content with a single click. To incorporate a huge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
