Conditional Selective Inference for the Selected Groups in Panel Data
Chuang Wan, Jiajun Sun, Xingbai Xu

TL;DR
This paper introduces a new selective inference method for testing differences in group-specific slopes in panel data after clustering, addressing the bias caused by using the same data for clustering and testing.
Contribution
It proposes a valid conditional inference approach that accounts for the selection process in clustering, extending to covariate differences and GMM frameworks.
Findings
Method shows good finite sample performance in simulations.
Applied to economic growth and CO2 emissions, revealing new insights.
Provides an R package for implementation.
Abstract
We consider the problem of testing for differences in group-specific slopes between the selected groups in panel data identified via k-means clustering. In this setting, the classical Wald-type test statistic is problematic because it produces an extremely inflated type I error probability. The underlying reason is that the same dataset is used to identify the group structure and construct the test statistic, simultaneously. This creates dependence between the selection and inference stages. To address this issue, we propose a valid selective inference approach conditional on the selection event to account for the selection effect. We formally define the selective type I error and describe how to efficiently compute the correct p-values for clusters obtained using k-means clustering. Furthermore, the same idea can be extended to test for differences in coefficients due to a single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial and Panel Data Analysis · Statistical Methods and Inference · Income, Poverty, and Inequality
