Partial Correlation Screening for Estimating Large Precision Matrices, with Applications to Classification
Shiqiong Huang, Jiashun Jin, Zhigang Yao

TL;DR
This paper introduces Partial Correlation Screening (PCS), an efficient method for estimating large precision matrices, and demonstrates its effectiveness in classification tasks, outperforming existing methods like glasso and competing with SVM and RF.
Contribution
PCS is a novel, computationally efficient approach for large precision matrix estimation that enables effective classification, with theoretical guarantees and practical superiority over existing methods.
Findings
PCS accurately recovers the support of the precision matrix.
HCT-PCS achieves optimal classification performance.
HCT-PCS outperforms HCT-glasso and is competitive with SVM and RF.
Abstract
We propose Partial Correlation Screening (PCS) as a new row-by-row approach to estimating a large precision matrix . To estimate the -th row of , , PCS uses a Screen step and a Clean step. In the Screen step, PCS recruits a (small) subset of indices using a stage-wise algorithm, where in each stage, the algorithm updates the set of recruited indices by adding the index that has the largest (in magnitude) empirical partial correlation with . In the Clean step, PCS re-investigates all recruited indices and use them to reconstruct the -th row of . PCS is computationally efficient and modest in memory use: to estimate a row of , it only needs a few rows (determined sequentially) of the empirical covariance matrix. This enables PCS to execute the estimation of a large precision matrix (e.g., ) in a few minutes, and open…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Statistical Methods and Inference · Bioinformatics and Genomic Networks
