Uniform-over-dimension location tests for multivariate and high-dimensional data
Ritabrata Karmakar, Joydeep Chowdhury, Subhajit Dutta, Marc G. Genton

TL;DR
This paper introduces a new hypothesis testing method for multivariate and high-dimensional data that remains valid across different dimensions, outperforming traditional tests like Hotelling's T-squared in various scenarios.
Contribution
The authors develop a uniform-over-dimension central limit theorem and an associated test for two-sample location equality that works effectively regardless of the data dimension.
Findings
The proposed test performs better than existing high-dimensional tests in simulations.
It outperforms Hotelling's T-squared test in real data applications.
The method is valid for a wide range of data dimensions without specific asymptotic conditions.
Abstract
Asymptotic methods for hypothesis testing in high-dimensional data usually require the dimension of the observations to increase to infinity, often with an additional relationship between the dimension (say, ) and the sample size (say, ). On the other hand, multivariate asymptotic testing methods are valid for fixed dimension only and their implementations typically require the sample size to be large compared to the dimension to yield desirable results. In practical scenarios, it is usually not possible to determine whether the dimension of the data conform to the conditions required for the validity of the high-dimensional asymptotic methods for hypothesis testing, or whether the sample size is large enough compared to the dimension of the data. In this work, we first describe the notion of uniform-over- convergences and subsequently, develop a uniform-over-dimension central…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Random Matrices and Applications · Advanced Statistical Methods and Models
