Misspecified Model Estimation and Its Impact on Predictions
Junnan He, Lin Hu, Matthew Kovach, and Anqi Li

TL;DR
This paper examines how misspecification of population coefficients in a linear model affects prediction accuracy, especially under certain conditions like residual information and alignment with latent factors.
Contribution
It provides a theoretical analysis of prediction distortions caused by coefficient misspecification in a linear model with applications to bias and consumer research.
Findings
Misspecification leads to systematic prediction errors.
Residual information influences the extent of prediction distortion.
Alignment between misspecification and latent-to-coefficient mapping affects bias.
Abstract
We study a linear statistical model where outcomes depend on regressors with fixed population coefficients and observation-specific latent coefficients, along with measurement errors. A decision-maker estimates population coefficients and uses the estimates to predict the latent coefficients for a given observation. We analyze how misspecification of some population coefficients distorts predictions, investigating comparative statics with respect to: (1) residual information in regressors associated with misspecified coefficients after projecting out those associated with free coefficients, (2) alignment between misspecification vector and latent-to-coefficient mapping. Applications include employee rating with unconscious bias and LLM-mediated consumer research.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
