Phase transitions in optimal unsupervised learning

Arnaud Buhot; Mirta B. Gordon

arXiv:cond-mat/9709274·cond-mat.dis-nn·October 30, 2009

Phase transitions in optimal unsupervised learning

Arnaud Buhot, Mirta B. Gordon

PDF

TL;DR

This paper analyzes the optimal performance and phase transitions in an unsupervised learning problem involving high-dimensional data with symmetry-breaking features, revealing conditions for first-order transitions and metastable states.

Contribution

It provides a theoretical analysis of the phase transitions and metastable states in optimal unsupervised learning of symmetry axes in high-dimensional data.

Findings

01

Identification of first-order phase transitions in learning curves

02

Existence of metastable high-performance states near transitions

03

Conditions under which optimal solutions are not learnable in Bayesian setting

Abstract

We determine the optimal performance of learning the orientation of the symmetry axis of a set of P = alpha N points that are uniformly distributed in all the directions but one on the N-dimensional sphere. The components along the symmetry breaking direction, of unitary vector B, are sampled from a mixture of two gaussians of variable separation and width. The typical optimal performance is measured through the overlap Ropt=B.J* where J* is the optimal guess of the symmetry breaking direction. Within this general scenario, the learning curves Ropt(alpha) may present first order transitions if the clusters are narrow enough. Close to these transitions, high performance states can be obtained through the minimization of the corresponding optimal potential, although these solutions are metastable, and therefore not learnable, within the usual bayesian scenario.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.