When Can We Trust Cluster-Robust Inference?

James G. MacKinnon

arXiv:2604.02000·econ.EM·April 3, 2026

When Can We Trust Cluster-Robust Inference?

James G. MacKinnon

PDF

TL;DR

This paper reviews various methods for cluster-robust inference in regression models, highlighting their strengths and limitations, and suggests using multiple procedures to assess the reliability of statistical inferences.

Contribution

It provides a comprehensive overview of different variance estimators and inference methods, guiding practitioners on assessing their trustworthiness in specific datasets.

Findings

01

Some methods outperform others in certain scenarios

02

Using multiple procedures improves confidence in inference reliability

03

No single method is universally reliable for all cases

Abstract

It is common when using cross-section or panel data to assign each observation to a cluster and allow for arbitrary patterns of heteroskedasticity and correlation within clusters. For regression models, there are many ways to make cluster-robust inferences. A number of different variance matrix estimators can be used. Hypothesis tests and confidence intervals can then be based on several alternative analytic or bootstrap distributions. Some methods typically perform much better than others, but no method yields reliable inferences in every case. Thus it can be hard to know which $P$ values and confidence intervals to trust. Nevertheless, by using a number of procedures to assess the reliability of various inferential methods for a specific model and dataset, we can often obtain results in which we may be reasonably confident.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.