(Ab)Using Regression for Data Adjustment
Lutz Duembgen

TL;DR
The paper critically examines the use of regression residuals for data adjustment in economic performance comparisons, highlighting potential pitfalls and interpretational issues of this common approach.
Contribution
It provides a detailed analysis of the limitations of using regression residuals for data adjustment and discusses alternative methods.
Findings
Residuals only capture part of the adjusted performance
Estimated regression functions can obscure true performance differences
Potential pitfalls can lead to misinterpretation of adjusted data
Abstract
In various economic applications, people want to compare units with respect to certain quantities measuring their performance. The latter, however, is often influenced by certain factors which are beyond control of the units, and one would like to extract an adjusted performance from the data. Specifically, let summarize the factors of the -th unit. Then one could think of a model equation with a regression function describing the unavoidable influence of the factors and being the adjusted performance of the -th unit. Now a common proposal is to estimate via regression methods by a function depending on the current data , possibly augmented by additional past data, and to use the residuals $\hat{\epsilon}_i := Y_i -…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models
