Robust approach for variable selection with high dimensional Logitudinal data analysis
Liya Fu, Jiaqi Li, You-Gan Wang

TL;DR
This paper introduces a robust variable selection method for high-dimensional longitudinal data that effectively handles outliers and correlation structures, demonstrating strong performance in simulations and real data applications.
Contribution
It presents a novel robust smooth-threshold estimating equation with a new correlation matrix, applicable when covariates grow with sample size, and establishes its oracle properties.
Findings
Method performs well with contaminated data
Competitive with true correlation estimates
Effective in high-dimensional settings
Abstract
This paper proposes a new robust smooth-threshold estimating equation to select important variables and automatically estimate parameters for high dimensional longitudinal data. A novel working correlation matrix is proposed to capture correlations within the same subject. The proposed procedure works well when the number of covariates p increases as the number of subjects n increases. The proposed estimates are competitive with the estimates obtained with the true correlation structure, especially when the data are contaminated. Moreover, the proposed method is robust against outliers in the response variables and/or covariates. Furthermore, the oracle properties for robust smooth-threshold estimating equations under "large n, diverging p" are established under some regularity conditions. Extensive simulation studies and a yeast cell cycle data are used to evaluate the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
