Cross Sectional Regression with Cluster Dependence: Inference based on Averaging
Subhodeep Dey, Gopal K. Basak, Samarjit Das

TL;DR
This paper examines the limitations of traditional pooled OLS estimators under cluster dependence and introduces a data averaging estimator that is more consistent and efficient, especially in complex cluster scenarios.
Contribution
The paper proposes a new averaging-based estimator for cross-sectional regression with cluster dependence, demonstrating its consistency and efficiency over traditional methods.
Findings
The averaging estimator is consistent even when pooled OLS is not.
The new estimator outperforms pooled OLS in efficiency and goodness of fit.
Simulation results confirm the estimator's practical advantages.
Abstract
We re-investigate the asymptotic properties of the traditional OLS (pooled) estimator, , in the context of cluster dependence. The present study considers various scenarios under various restrictions on the cluster sizes and number of clusters. It is shown that could be inconsistent in many realistic situations. We propose a simple estimator, based on data averaging. The asymptotic properties of are studied. It is shown that is consistent even when is inconsistent. It is further shown that the proposed estimator is more efficient than in many practical scenarios. As a consequence of averaging, we show that retains consistency, asymptotic normality under classical measurement error problem circumventing the use of Instrumental Variables (IV). A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models
