How AI Forecasts AI Jobs: Benchmarking LLM Predictions of Labor Market Changes
Sheri Osborn, Rohit Valecha, H. Raghav Rao, Dan Sass, Anthony Rios

TL;DR
This paper introduces a benchmark to evaluate how well large language models can predict labor market changes due to AI, using datasets of job postings and occupational forecasts, and assesses different prompting strategies.
Contribution
It presents a new benchmark combining sector-level job data and AI-driven occupational forecasts to systematically evaluate LLMs' labor market prediction capabilities.
Findings
Structured prompts improve forecast stability
Persona prompts enhance short-term trend predictions
Performance varies across sectors and time horizons
Abstract
Artificial intelligence is reshaping labor markets, yet we lack tools to systematically forecast its effects on employment. This paper introduces a benchmark for evaluating how well large language models (LLMs) can anticipate changes in job demand, especially in occupations affected by AI. Existing research has shown that LLMs can extract sentiment, summarize economic reports, and emulate forecaster behavior, but little work has assessed their use for forward-looking labor prediction. Our benchmark combines two complementary datasets: a high-frequency index of sector-level job postings in the United States, and a global dataset of projected occupational changes due to AI adoption. We format these data into forecasting tasks with clear temporal splits, minimizing the risk of information leakage. We then evaluate LLMs using multiple prompting strategies, comparing task-scaffolded,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
