Generative Augmented Inference
Cheng Lu, Mengxin Wang, Dennis J. Zhang, Heng Zhang

TL;DR
Generative Augmented Inference (GAI) is a flexible framework that leverages AI outputs as auxiliary features to improve estimation of human-labeled outcomes, offering efficiency gains and robust inference.
Contribution
GAI introduces an orthogonal moment construction enabling consistent, efficient, and valid inference with AI-generated auxiliary signals, even when their relationship to true labels is complex or unknown.
Findings
GAI reduces estimation error by about 50% in conjoint analysis.
GAI lowers human labeling needs by over 75% in certain tasks.
GAI improves confidence interval coverage without increasing width.
Abstract
Data-driven operations management often relies on parameters estimated from costly human-generated labels. Recent advances in large language models (LLMs) and other AI systems offer inexpensive auxiliary data, but introduce a new challenge: AI outputs are not direct observations of the target outcomes, but could involve high-dimensional representations with complex and unknown relationships to human labels. Conventional methods leverage AI predictions as direct proxies for true labels, which can be inefficient or unreliable when this relationship is weak or misspecified. We propose Generative Augmented Inference (GAI), a general framework that incorporates AI-generated outputs as informative features for estimating models of human-labeled outcomes. GAI uses an orthogonal moment construction that enables consistent estimation and valid inference with flexible, nonparametric relationship…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
