Improving Generative Ad Text on Facebook using Reinforcement Learning

Daniel R. Jiang; Alex Nikulkov; Yu-Chia Chen; Yang Bai; Zheqing Zhu

arXiv:2507.21983·cs.LG·December 16, 2025

Improving Generative Ad Text on Facebook using Reinforcement Learning

Daniel R. Jiang, Alex Nikulkov, Yu-Chia Chen, Yang Bai, Zheqing Zhu

PDF

TL;DR

This paper presents the first large-scale deployment of reinforcement learning to improve generative ad text on Facebook, demonstrating a 6.7% increase in click-through rates and higher advertiser satisfaction.

Contribution

It introduces RLPF, a novel reinforcement learning post-training method using real-world performance data, and provides the first large-scale empirical evaluation in an ecologically valid setting.

Findings

01

AdLlama increased click-through rates by 6.7%.

02

Advertisers generated more ad variations with AdLlama.

03

RLPF proved effective and generalizable for metric-driven post-training.

Abstract

Generative artificial intelligence (AI), in particular large language models (LLMs), is poised to drive transformative economic change. LLMs are pre-trained on vast text data to learn general language patterns, but a subsequent post-training phase is critical to align them for specific real-world tasks. Reinforcement learning (RL) is the leading post-training technique, yet its economic impact remains largely underexplored and unquantified. We examine this question through the lens of the first deployment of an RL-trained LLM for generative advertising on Facebook. Integrated into Meta's Text Generation feature, our model, "AdLlama," powers an AI tool that helps advertisers create new variations of human-written ad text. To train this model, we introduce reinforcement learning with performance feedback (RLPF), a post-training method that uses historical ad performance data as a reward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.