AeSlides: Incentivizing Aesthetic Layout in LLM-Based Slide Generation via Verifiable Rewards

Yiming Pan; Chengwei Hu; Xuancheng Huang; Can Huang; Mingming Zhao; Yuean Bi; Xiaohan Zhang; Aohan Zeng; Linmei Hu

arXiv:2604.22840·cs.CV·April 28, 2026

AeSlides: Incentivizing Aesthetic Layout in LLM-Based Slide Generation via Verifiable Rewards

Yiming Pan, Chengwei Hu, Xuancheng Huang, Can Huang, Mingming Zhao, Yuean Bi, Xiaohan Zhang, Aohan Zeng, Linmei Hu

PDF

1 Repo 1 Datasets

TL;DR

AeSlides introduces a reinforcement learning framework with verifiable aesthetic metrics to improve the visual layout quality of slides generated by large language models, achieving significant aesthetic improvements with minimal training data.

Contribution

The paper presents a novel reinforcement learning approach using verifiable aesthetic metrics to directly optimize slide layout quality in LLM-based slide generation.

Findings

01

Improved aspect ratio compliance from 36% to 85%.

02

Reduced whitespace by 44% and element collisions by 43%.

03

Human evaluation scores increased by 7.6%.

Abstract

Large language models (LLMs) have demonstrated strong potential in agentic tasks, particularly in slide generation. However, slide generation poses a fundamental challenge: the generation process is text-centric, whereas its quality is governed by visual aesthetics. This modality gap leads current models to frequently produce slides with aesthetically suboptimal layouts. Existing solutions typically rely either on heavy visual reflection, which incurs high inference cost yet yields limited gains; or on fine-tuning with large-scale datasets, which still provides weak and indirect aesthetic supervision. In contrast, the explicit use of aesthetic principles as supervision remains unexplored. In this work, we present AeSlides, a reinforcement learning framework with verifiable rewards for Aesthetic layout supervision in Slide generation. We introduce a suite of meticulously designed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ympan0508/aeslides
github

Datasets

ympan/aeslides-reward-bench
dataset· 1.9k dl
1.9k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.