Ecological fallacy and covariates: new insights based on multilevel modelling of individual data
Michela Gnaldi, Venera Tomaselli, Antonio Forcina

TL;DR
This paper examines ecological bias in ecological inference, demonstrating that unbiased estimates require specific conditions, and shows that models incorporating covariates are essential when these conditions are violated, supported by empirical analysis of voting data.
Contribution
It provides explicit conditions for unbiased ecological regression and highlights the importance of covariate modeling to correct ecological bias, supported by multilevel logistic regression analysis.
Findings
Ecological bias occurs when conditions are violated.
Covariate-inclusive models can produce unbiased estimates.
Weak association between variables limits unbiased inference.
Abstract
This paper deals with the issue of ecological bias in ecological inference. We provide an explicit formulation of the conditions required for the ordinary ecological regression to produce unbiased estimates and argue that, when these conditions are violated, any method of ecological inference is going to produce biased estimates. These findings are clarified and supported by empirical evidence provided by comparing the results of three main ecological inference methods with those of multilevel logistic regression applied to a unique set of individual data on voting behaviour. The main findings of our study have two important implications that apply to all situations where the conditions for no ecological bias are violated: (i) only ecological inference methods that allow to model the effect of covariates have a chance to produce unbiased estimates; (ii) the set of covariates to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
