# Diagonally-Dominant Principal Component Analysis

**Authors:** Zheng Tracy Ke, Lingzhou Xue, Fan Yang

arXiv: 1906.00051 · 2019-06-04

## TL;DR

This paper introduces Diagonally-Dominant PCA (DD-PCA), a novel matrix decomposition method that enhances covariance estimation and multiple testing procedures for strongly correlated data, with demonstrated practical benefits.

## Contribution

The paper develops DD-PCA, a new decomposition technique, and applies it to improve covariance estimation and global null hypothesis testing in correlated data scenarios.

## Key findings

- DD-PCA provides accurate covariance and precision matrix estimation.
- New tests outperform existing methods in multiple testing scenarios.
- Algorithms for efficient DD-PCA computation are proposed.

## Abstract

We consider the problem of decomposing a large covariance matrix into the sum of a low-rank matrix and a diagonally dominant matrix, and we call this problem the "Diagonally-Dominant Principal Component Analysis (DD-PCA)". DD-PCA is an effective tool for designing statistical methods for strongly correlated data. We showcase the use of DD-PCA in two statistical problems: covariance matrix estimation, and global detection in multiple testing. Using the output of DD-PCA, we propose a new estimator for estimating a large covariance matrix with factor structure. Thanks to a nice property of diagonally dominant matrices, this estimator enjoys the advantage of simultaneous good estimation of the covariance matrix and the precision matrix (by a plain inversion). A plug-in of this estimator to linear discriminant analysis and portfolio optimization yields appealing performance in real data. We also propose two new tests for testing the global null hypothesis in multiple testing when the $z$-scores have a factor covariance structure. Both tests first use DD-PCA to adjust the individual $p$-values and then plug in the adjusted $p$-values to the Higher Criticism (HC) test. These new tests significantly improve over the HC test and compare favorably with other existing tests. For computation of DD-PCA, we propose an iterative projection algorithm and an ADMM algorithm.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.00051/full.md

## Figures

24 figures with captions in the complete paper: https://tomesphere.com/paper/1906.00051/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/1906.00051/full.md

---
Source: https://tomesphere.com/paper/1906.00051