Aesthetic Photo Collage with Deep Reinforcement Learning
Mingrui Zhang, Mading Li, Li Chen, Jiahao Yu

TL;DR
This paper introduces a deep reinforcement learning approach for automatic photo collage generation that optimizes aesthetic quality by modeling layout as a sequential decision process, overcoming data limitations with pretraining.
Contribution
It is the first to apply reinforcement learning to collage layout optimization and integrates a pretraining strategy with an attention fusion module for aesthetic assessment.
Findings
Outperforms existing methods in aesthetic quality evaluation.
Successfully models collage as a sequential decision process.
User studies confirm improved aesthetic appeal.
Abstract
Photo collage aims to automatically arrange multiple photos on a given canvas with high aesthetic quality. Existing methods are based mainly on handcrafted feature optimization, which cannot adequately capture high-level human aesthetic senses. Deep learning provides a promising way, but owing to the complexity of collage and lack of training data, a solution has yet to be found. In this paper, we propose a novel pipeline for automatic generation of aspect ratio specified collage and the reinforcement learning technique is introduced in collage for the first time. Inspired by manual collages, we model the collage generation as sequential decision process to adjust spatial positions, orientation angles, placement order and the global layout. To instruct the agent to improve both the overall layout and local details, the reward function is specially designed for collage, considering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Aesthetic Perception and Analysis · Olfactory and Sensory Function Studies
MethodsTest
