Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal data
Ming-Yen Cheng, Toshio Honda, Jialiang Li, Heng Peng

TL;DR
This paper introduces a new efficient method for variable screening and structure identification in ultra-high dimensional longitudinal data, improving accuracy and computational efficiency over existing approaches.
Contribution
It proposes a novel two-step procedure combining screening and semivarying coefficient modeling with theoretical guarantees and practical implementation strategies.
Findings
High probability of correctly screening irrelevant variables
Method achieves desirable sparsity and oracle properties
Demonstrated effective performance on simulated and real data
Abstract
Ultra-high dimensional longitudinal data are increasingly common and the analysis is challenging both theoretically and methodologically. We offer a new automatic procedure for finding a sparse semivarying coefficient model, which is widely accepted for longitudinal data analysis. Our proposed method first reduces the number of covariates to a moderate order by employing a screening procedure, and then identifies both the varying and constant coefficients using a group SCAD estimator, which is subsequently refined by accounting for the within-subject correlation. The screening procedure is based on working independence and B-spline marginal models. Under weaker conditions than those in the literature, we show that with high probability only irrelevant variables will be screened out, and the number of selected variables can be bounded by a moderate order. This allows the desirable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
