Measuring What Cannot Be Surveyed: LLMs as Instruments for Latent Cognitive Variables in Labor Economics
Cristian Espinal Maya

TL;DR
This paper develops a framework for using Large Language Models as measurement tools for latent economic variables, validated through a new occupational index with strong validity and reliability.
Contribution
It formalizes conditions for LLM-based instruments, constructs a novel occupational index, and demonstrates its validity and robustness in labor economics.
Findings
The occupational index shows high convergent and discriminant validity.
Inter-rater reliability between LLM models is strong (r=0.76).
ORIV estimates are 25% larger than OLS, indicating measurement error correction.
Abstract
This paper establishes the theoretical and practical foundations for using Large Language Models (LLMs) as measurement instruments for latent economic variables -- specifically variables that describe the cognitive content of occupational tasks at a level of granularity not achievable with existing survey instruments. I formalize four conditions under which LLM-generated scores constitute valid instruments: semantic exogeneity, construct relevance, monotonicity, and model invariance. I then apply this framework to the Augmented Human Capital Index (AHC_o), constructed from 18,796 O*NET task statements scored by Claude Haiku 4.5, and validated against six existing AI exposure indices. The index shows strong convergent validity (r = 0.85 with Eloundou GPT-gamma, r = 0.79 with Felten AIOE) and discriminant validity. Principal component analysis confirms that AI-related occupational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
