An Empirical Bayes Approach for High Dimensional Classification
Yunbo Ouyang, Feng Liang

TL;DR
This paper introduces an empirical Bayes method using Dirichlet process mixtures for high-dimensional classification, providing theoretical error bounds and an efficient variational Bayes algorithm suitable for ultra-high dimensions.
Contribution
It presents a novel empirical Bayes estimator for sparse mean differences and establishes theoretical links between estimation and classification errors.
Findings
The method achieves competitive classification accuracy.
Theoretical conditions for optimal and sub-optimal classifiers are provided.
An efficient parallelizable algorithm is developed for ultra-high dimensional data.
Abstract
We propose an empirical Bayes estimator based on Dirichlet process mixture model for estimating the sparse normalized mean difference, which could be directly applied to the high dimensional linear classification. In theory, we build a bridge to connect the estimation error of the mean difference and the misclassification error, also provide sufficient conditions of sub-optimal classifiers and optimal classifiers. In implementation, a variational Bayes algorithm is developed to compute the posterior efficiently and could be parallelized to deal with the ultra-high dimensional case.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Gene expression and cancer classification · Statistical Methods and Inference
