High Dimensional Statistical Inference and Random Matrices
Iain M. Johnstone

TL;DR
This paper reviews the development of multivariate statistical analysis and its connection with random matrix theory (RMT), emphasizing how RMT offers valuable tools for high-dimensional data analysis.
Contribution
It highlights the historical development and recent progress in applying RMT to high-dimensional multivariate statistical inference.
Findings
RMT provides new insights into high-dimensional data structures
Connections between multivariate statistics and RMT have been increasingly exploited
Recent advances improve understanding of complex data in genetics and social sciences
Abstract
Multivariate statistical analysis is concerned with observations on several variables which are thought to possess some degree of inter-dependence. Driven by problems in genetics and the social sciences, it first flowered in the earlier half of the last century. Subsequently, random matrix theory (RMT) developed, initially within physics, and more recently widely in mathematics. While some of the central objects of study in RMT are identical to those of multivariate statistics, statistical theory was slow to exploit the connection. However, with vast data collection ever more common, data sets now often have as many or more variables than the number of individuals observed. In such contexts, the techniques and results of RMT have much to offer multivariate statistics. The paper reviews some of the progress to date.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Advanced Combinatorial Mathematics · Stochastic processes and statistical mechanics
