Balancing Flexibility and Interpretability: A Conditional Linear Model Estimation via Random Forest
Ricardo Masini, Marcelo Medeiros

TL;DR
This paper introduces a novel method combining random forests with linear modeling to estimate heterogeneous effects, balancing interpretability and flexibility in econometric analysis.
Contribution
It develops a new approach to estimate conditional linear models with heterogeneity using random forests, unifying parametric and nonparametric methods with theoretical guarantees.
Findings
The method achieves favorable finite-sample performance in simulations.
It provides consistent estimation and inference for heterogeneous effects.
The approach generalizes several existing models like varying-coefficient and additive models.
Abstract
Traditional parametric econometric models often rely on rigid functional forms, while nonparametric techniques, despite their flexibility, frequently lack interpretability. This paper proposes a parsimonious alternative by modeling the outcome as a linear function of a vector of variables of interest , conditional on additional covariates . Specifically, the conditional expectation is expressed as , where is an unknown Lipschitz-continuous function. We introduce an adaptation of the Random Forest (RF) algorithm to estimate this model, balancing the flexibility of machine learning methods with the interpretability of traditional linear models. This approach addresses a key challenge in applied econometrics by accommodating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
