Classification of Data Generated by Gaussian Mixture Models Using Deep   ReLU Networks

Tian-Yi Zhou; Xiaoming Huo

arXiv:2308.08030·stat.ML·August 17, 2023·1 cites

Classification of Data Generated by Gaussian Mixture Models Using Deep ReLU Networks

Tian-Yi Zhou, Xiaoming Huo

PDF

Open Access

TL;DR

This paper provides non-asymptotic bounds and convergence rates for deep ReLU networks classifying data from Gaussian Mixture Models, showing they can overcome the curse of dimensionality in unbounded domains.

Contribution

It introduces the first non-asymptotic analysis of deep ReLU networks for GMM data, with bounds independent of dimension, and develops a novel approximation error bound for analytic functions.

Findings

01

Deep ReLU networks achieve dimension-independent convergence rates.

02

The analysis applies to unbounded domains using Gaussian distribution properties.

03

Results support the practical efficiency of deep neural networks in classification tasks.

Abstract

This paper studies the binary classification of unbounded data from $R^{d}$ generated under Gaussian Mixture Models (GMMs) using deep ReLU neural networks. We obtain $\unicode x 2013$ for the first time $\unicode x 2013$ non-asymptotic upper bounds and convergence rates of the excess risk (excess misclassification error) for the classification without restrictions on model parameters. The convergence rates we derive do not depend on dimension $d$ , demonstrating that deep ReLU networks can overcome the curse of dimensionality in classification. While the majority of existing generalization analysis of classification algorithms relies on a bounded domain, we consider an unbounded domain by leveraging the analyticity and fast decay of Gaussian distributions. To facilitate our analysis, we give a novel approximation error bound for general analytic functions using ReLU networks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference