Permutation Inference under Multi-way Clustering and Missing Data
Wenxuan Guo, Panos Toulis, Yuhao Wang

TL;DR
This paper introduces a finite-sample valid permutation inference framework for linear regression with multi-way clustering, addressing issues of small clusters and missing data, and demonstrating improved reliability over traditional methods.
Contribution
It develops a permutation testing approach under minimal assumptions that is valid in finite samples and extends it to handle missing data, with theoretical power guarantees.
Findings
Permutation tests maintain correct size in small samples.
Proposed methods outperform standard procedures in simulations.
Power analysis reveals phase transitions in detectability.
Abstract
Econometric applications with multi-way clustering often feature a small number of effective clusters or heavy-tailed data, making standard cluster-robust and bootstrap inference unreliable in finite samples. In this paper, we develop a framework for finite-sample valid permutation inference in linear regression with multi-way clustering under an assumption of conditional exchangeability of the errors. Our assumption is closely related to the notion of separate exchangeability studied in earlier work, but can be more realistic in many economic settings as it imposes minimal restrictions on the covariate distribution. We construct permutation tests of significance that are valid in finite samples and establish theoretical power guarantees, in contrast to existing methods that are justified only asymptotically. We also extend our methodology to settings with missing data and derive power…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
