Long-Tail Theory under Gaussian Mixtures

Arman Bolatov; Maxat Tezekbayev; Igor Melnykov; Artur Pak; Vassilina; Nikoulina; Zhenisbek Assylbekov

arXiv:2307.10736·cs.LG·July 26, 2023·1 cites

Long-Tail Theory under Gaussian Mixtures

Arman Bolatov, Maxat Tezekbayev, Igor Melnykov, Artur Pak, Vassilina, Nikoulina, Zhenisbek Assylbekov

PDF

Open Access 1 Repo

TL;DR

This paper models long-tail data using Gaussian mixtures and shows that nonlinear classifiers outperform linear ones in generalization, especially with longer tails, highlighting the importance of rare examples.

Contribution

It introduces a Gaussian mixture model aligned with Feldman's long tail theory and demonstrates the impact of classifier type and tail length on generalization performance.

Findings

01

Nonlinear classifiers can surpass linear classifiers in long-tail settings.

02

The performance gap decreases as the tail shortens.

03

Experimental validation on synthetic and real data supports the theory.

Abstract

We suggest a simple Gaussian mixture model for data generation that complies with Feldman's long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

armanbolatov/long_tail
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Target Tracking and Data Fusion in Sensor Networks