Bayesian Optimal Two-sample Tests in High-dimension

Kyoungjae Lee; Kisung You; Lizhen Lin

arXiv:2112.02580·stat.ME·December 7, 2021

Bayesian Optimal Two-sample Tests in High-dimension

Kyoungjae Lee, Kisung You, Lizhen Lin

PDF

Open Access

TL;DR

This paper introduces Bayesian two-sample tests designed for high-dimensional data, which are more powerful and scalable, especially when differences are sparse, outperforming existing methods in simulations and real gene expression data analysis.

Contribution

The paper develops novel Bayesian two-sample tests with closed-form Bayes factors that are optimal, scalable, and outperform existing methods in high-dimensional, sparse difference scenarios.

Findings

01

Proposed tests are consistent under mild conditions.

02

Tests outperform state-of-the-art methods in simulations.

03

Applied successfully to gene expression data analysis.

Abstract

We propose optimal Bayesian two-sample tests for testing equality of high-dimensional mean vectors and covariance matrices between two populations. In many applications including genomics and medical imaging, it is natural to assume that only a few entries of two mean vectors or covariance matrices are different. Many existing tests that rely on aggregating the difference between empirical means or covariance matrices are not optimal or yield low power under such setups. Motivated by this, we develop Bayesian two-sample tests employing a divide-and-conquer idea, which is powerful especially when the difference between two populations is sparse but large. The proposed two-sample tests manifest closed forms of Bayes factors and allow scalable computations even in high-dimensions. We prove that the proposed tests are consistent under relatively mild conditions compared to existing tests in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference