Prompting for Policy: Forecasting Macroeconomic Scenarios with Synthetic LLM Personas
Giulia Iadisernia, Carolina Camassa

TL;DR
This study assesses whether persona-based prompting enhances GPT-4o's macroeconomic forecasts, finding that such prompts do not significantly improve accuracy and that GPT-4o performs comparably to human experts across multiple economic indicators.
Contribution
It demonstrates that persona prompts do not significantly affect forecasting accuracy, and GPT-4o can reliably predict macroeconomic variables without complex prompt engineering.
Findings
GPT-4o matches human forecast accuracy.
Persona prompts do not improve predictions.
GPT-4o maintains performance on unseen data.
Abstract
We evaluate whether persona-based prompting improves Large Language Model (LLM) performance on macroeconomic forecasting tasks. Using 2,368 economics-related personas from the PersonaHub corpus, we prompt GPT-4o to replicate the ECB Survey of Professional Forecasters across 50 quarterly rounds (2013-2025). We compare the persona-prompted forecasts against the human experts panel, across four target variables (HICP, core HICP, GDP growth, unemployment) and four forecast horizons. We also compare the results against 100 baseline forecasts without persona descriptions to isolate its effect. We report two main findings. Firstly, GPT-4o and human forecasters achieve remarkably similar accuracy levels, with differences that are statistically significant yet practically modest. Our out-of-sample evaluation on 2024-2025 data demonstrates that GPT-4o can maintain competitive forecasting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · Text Readability and Simplification · Digital Economy and Work Transformation
