Improving Generative Ad Text on Facebook using Reinforcement Learning
Daniel R. Jiang, Alex Nikulkov, Yu-Chia Chen, Yang Bai, Zheqing Zhu

TL;DR
This paper presents the first large-scale deployment of reinforcement learning to improve generative ad text on Facebook, demonstrating a 6.7% increase in click-through rates and higher advertiser satisfaction.
Contribution
It introduces RLPF, a novel reinforcement learning post-training method using real-world performance data, and provides the first large-scale empirical evaluation in an ecologically valid setting.
Findings
AdLlama increased click-through rates by 6.7%.
Advertisers generated more ad variations with AdLlama.
RLPF proved effective and generalizable for metric-driven post-training.
Abstract
Generative artificial intelligence (AI), in particular large language models (LLMs), is poised to drive transformative economic change. LLMs are pre-trained on vast text data to learn general language patterns, but a subsequent post-training phase is critical to align them for specific real-world tasks. Reinforcement learning (RL) is the leading post-training technique, yet its economic impact remains largely underexplored and unquantified. We examine this question through the lens of the first deployment of an RL-trained LLM for generative advertising on Facebook. Integrated into Meta's Text Generation feature, our model, "AdLlama," powers an AI tool that helps advertisers create new variations of human-written ad text. To train this model, we introduce reinforcement learning with performance feedback (RLPF), a post-training method that uses historical ad performance data as a reward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
