Conditional Selective Inference for the Selected Groups in Panel Data

Chuang Wan; Jiajun Sun; Xingbai Xu

arXiv:2511.04466·stat.ME·November 7, 2025

Conditional Selective Inference for the Selected Groups in Panel Data

Chuang Wan, Jiajun Sun, Xingbai Xu

PDF

Open Access

TL;DR

This paper introduces a new selective inference method for testing differences in group-specific slopes in panel data after clustering, addressing the bias caused by using the same data for clustering and testing.

Contribution

It proposes a valid conditional inference approach that accounts for the selection process in clustering, extending to covariate differences and GMM frameworks.

Findings

01

Method shows good finite sample performance in simulations.

02

Applied to economic growth and CO2 emissions, revealing new insights.

03

Provides an R package for implementation.

Abstract

We consider the problem of testing for differences in group-specific slopes between the selected groups in panel data identified via k-means clustering. In this setting, the classical Wald-type test statistic is problematic because it produces an extremely inflated type I error probability. The underlying reason is that the same dataset is used to identify the group structure and construct the test statistic, simultaneously. This creates dependence between the selection and inference stages. To address this issue, we propose a valid selective inference approach conditional on the selection event to account for the selection effect. We formally define the selective type I error and describe how to efficiently compute the correct p-values for clusters obtained using k-means clustering. Furthermore, the same idea can be extended to test for differences in coefficients due to a single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpatial and Panel Data Analysis · Statistical Methods and Inference · Income, Poverty, and Inequality