Robust rank correlation based screening
Gaorong Li, Heng Peng, Jun Zhang, Lixing Zhu

TL;DR
This paper introduces a robust rank correlation screening method for ultra-high dimensional data that is resilient to outliers and applicable to semiparametric models, with theoretical guarantees and practical validation.
Contribution
It proposes a novel RRCS method based on Kendall tau correlation, extending variable screening to more robust and flexible models with weaker assumptions.
Findings
Effective in ultra-high dimensions with exponential growth of predictors
Robust against outliers and influence points
Performs well in simulations and real data analysis
Abstract
Independence screening is a variable selection method that uses a ranking criterion to select significant variables, particularly for statistical models with nonpolynomial dimensionality or "large p, small n" paradigms when p can be as large as an exponential of the sample size n. In this paper we propose a robust rank correlation screening (RRCS) method to deal with ultra-high dimensional data. The new procedure is based on the Kendall \tau correlation coefficient between response and predictor variables rather than the Pearson correlation of existing methods. The new method has four desirable features compared with existing independence screening methods. First, the sure independence screening property can hold only under the existence of a second order moment of predictor variables, rather than exponential tails or alikeness, even when the number of predictor variables grows as fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
