DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation

Hao Zheng; Guozhao Mo; Xinru Yan; Qianhao Yuan; Wenkai Zhang; Xuanang Chen; Yaojie Lu; Hongyu Lin; Xianpei Han; and Le Sun

arXiv:2602.22839·cs.AI·April 21, 2026

DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation

Hao Zheng, Guozhao Mo, Xinru Yan, Qianhao Yuan, Wenkai Zhang, Xuanang Chen, Yaojie Lu, Hongyu Lin, Xianpei Han, and Le Sun

PDF

1 Repo 2 Models 1 Datasets

TL;DR

DeepPresenter is an environment-grounded framework for autonomous presentation generation that adapts to user needs, refines content iteratively, and corrects issues during execution, achieving state-of-the-art results.

Contribution

It introduces an agentic, environment-grounded approach that enables flexible, feedback-driven presentation creation beyond fixed templates and scripted pipelines.

Findings

01

Achieves state-of-the-art performance on diverse presentation tasks.

02

Fine-tuned 9B model remains highly competitive at lower cost.

03

Supports long-horizon refinement through environmental observations.

Abstract

Presentation generation requires deep content research, coherent visual design, and iterative refinement based on observation. However, existing presentation agents often rely on predefined workflows and fixed templates. To address this, we present DeepPresenter, an agentic framework that adapts to diverse user intents, enables effective feedback-driven refinement, and generalizes beyond a scripted pipeline. Specifically, DeepPresenter autonomously plans, renders, and revises intermediate slide artifacts to support long-horizon refinement with environmental observations. Furthermore, rather than relying on self-reflection over internal signals (e.g., reasoning traces), our environment-grounded reflection conditions the generation process on perceptual artifact states (e.g., rendered slides), enabling the system to identify and correct presentation-specific issues during execution.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

icip-cas/PPTAgent
github

Models

Datasets

Forceless/UltraPresent
dataset· 1.8k dl
1.8k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.