Consistent Selection of the Number of Groups in Panel Models via Cross-Validation
Zhe Li, Xuening Zhu, Changliang Zou

TL;DR
This paper introduces a data-driven cross-validation method for accurately selecting the number of groups in panel data models, ensuring consistency and broad applicability across various models.
Contribution
The paper proposes a novel CV approach for group number selection that is fully data-driven, tuning-free, and adaptable to diverse panel data models.
Findings
The CV method accurately estimates the number of groups in synthetic datasets.
The method demonstrates consistency under theoretical conditions.
Application to Chinese stock data reveals heterogeneous volatility patterns.
Abstract
Group number selection is a key problem for group panel data modeling. In this work, we develop a cross-validation (CV) method to tackle this problem. Specifically, we split the panel data into two data folds on the time span, with group structure preserved for individuals. We first estimate the group memberships and parameters on one data fold, then we plug in the estimates and utilize the other data fold to evaluate a designed criterion. Subsequently, the group number is estimated by minimizing the average criterion across all data folds. The proposed CV method has two advantages compared to existing approaches. First, the method is totally data-driven, thus no further tuning parameters are involved. Second, the method can be flexibly applied to a wide range of panel data models. Theoretically, we establish the estimation consistency by taking advantage of the optimization property of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial and Panel Data Analysis · demographic modeling and climate adaptation · Human Mobility and Location-Based Analysis
