A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping
Debang Li, Huikai Wu, Junge Zhang, Kaiqi Huang

TL;DR
This paper introduces A2-RL, a reinforcement learning framework for image cropping that models the task as a sequential decision process, improving efficiency and aesthetic quality without bounding box supervision.
Contribution
It formulates aesthetic image cropping as a sequential decision-making problem and develops an aesthetics-aware reward function within a reinforcement learning framework.
Findings
Achieves state-of-the-art performance on unseen cropping datasets.
Uses fewer candidate windows and less computation time.
Outperforms previous weakly supervised methods.
Abstract
Image cropping aims at improving the aesthetic quality of images by adjusting their composition. Most weakly supervised cropping methods (without bounding box supervision) rely on the sliding window mechanism. The sliding window mechanism requires fixed aspect ratios and limits the cropping region with arbitrary size. Moreover, the sliding window method usually produces tens of thousands of windows on the input image which is very time-consuming. Motivated by these challenges, we firstly formulate the aesthetic image cropping as a sequential decision-making process and propose a weakly supervised Aesthetics Aware Reinforcement Learning (A2-RL) framework to address this problem. Particularly, the proposed method develops an aesthetics aware reward function which especially benefits image cropping. Similar to human's decision making, we use a comprehensive state representation including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Olfactory and Sensory Function Studies · Image and Video Quality Assessment
