Crashing Waves vs. Rising Tides: Preliminary Findings on AI Automation from Thousands of Worker Evaluations of Labor Market Tasks
Matthias Mertens, Adam Kuzee, Brittany S. Harris, Harry Lyu, Wensu Li, Jonathan Rosenfeld, Meiri Anto, Martin Fleming, Neil Thompson

TL;DR
This paper analyzes the progression of AI automation, finding evidence for a gradual 'rising tide' increase in capabilities across many tasks, rather than abrupt 'crashing waves' surges, based on evaluations of over 3,000 labor market tasks.
Contribution
It provides empirical evidence that AI capabilities are improving steadily across a broad range of tasks, challenging the notion of sudden 'waves' of automation.
Findings
AI models currently complete tasks in 3-4 hours with ~50% success rate in 2024-Q2.
Success rates are projected to reach 80-95% by 2029 if current growth trends continue.
Little evidence was found for abrupt 'crashing waves' in AI capability growth.
Abstract
We propose that AI automation is a continuum between: (i) crashing waves where AI capabilities surge abruptly over small sets of tasks, and (ii) rising tides where the increase in AI capabilities is more continuous and broad-based. We test for these effects in preliminary evidence from an ongoing evaluation of AI capabilities across over 3,000 broad-based tasks derived from the U.S. Department of Labor O*NET categorization that are text-based and thus LLM-addressable. Based on more than 17,000 evaluations by workers from these jobs, we find little evidence of crashing waves (in contrast to recent work by METR), but substantial evidence that rising tides are the primary form of AI automation. AI performance is high and improving rapidly across a wide range of tasks. We estimate that, in 2024-Q2, AI models successfully complete tasks that take humans approximately 3-4 hours with about a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
