An Empirical Bayes Approach for High Dimensional Classification

Yunbo Ouyang; Feng Liang

arXiv:1702.05056·stat.ML·February 17, 2017·2 cites

An Empirical Bayes Approach for High Dimensional Classification

Yunbo Ouyang, Feng Liang

PDF

Open Access 1 Repo

TL;DR

This paper introduces an empirical Bayes method using Dirichlet process mixtures for high-dimensional classification, providing theoretical error bounds and an efficient variational Bayes algorithm suitable for ultra-high dimensions.

Contribution

It presents a novel empirical Bayes estimator for sparse mean differences and establishes theoretical links between estimation and classification errors.

Findings

01

The method achieves competitive classification accuracy.

02

Theoretical conditions for optimal and sub-optimal classifiers are provided.

03

An efficient parallelizable algorithm is developed for ultra-high dimensional data.

Abstract

We propose an empirical Bayes estimator based on Dirichlet process mixture model for estimating the sparse normalized mean difference, which could be directly applied to the high dimensional linear classification. In theory, we build a bridge to connect the estimation error of the mean difference and the misclassification error, also provide sufficient conditions of sub-optimal classifiers and optimal classifiers. In implementation, a variational Bayes algorithm is developed to compute the posterior efficiently and could be parallelized to deal with the ultra-high dimensional case.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yunboouyang/EBclassifier
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Gene expression and cancer classification · Statistical Methods and Inference