Permutative redundancy and uncertainty of the objective in deep learning

Vacslav Glukhov

arXiv:2411.07008·cs.AI·November 12, 2024

Permutative redundancy and uncertainty of the objective in deep learning

Vacslav Glukhov

PDF

Open Access

TL;DR

This paper discusses how the symmetry and uncertainty in deep learning objectives create numerous equivalent optima, complicating optimization, and explores potential remedies like pruning, reordering, and bio-inspired architectures.

Contribution

It highlights the impact of permutative symmetry and objective uncertainty on optimization landscapes and proposes methods to mitigate ghost optima in deep learning models.

Findings

01

Traditional architectures have many equivalent global and local optima.

02

Uncertainty in objectives prevents local optima from being reached.

03

Proposed remedies can reduce or eliminate ghost optima.

Abstract

Implications of uncertain objective functions and permutative symmetry of traditional deep learning architectures are discussed. It is shown that traditional architectures are polluted by an astronomical number of equivalent global and local optima. Uncertainty of the objective makes local optima unattainable, and, as the size of the network grows, the global optimization landscape likely becomes a tangled web of valleys and ridges. Some remedies which reduce or eliminate ghost optima are discussed including forced pre-pruning, re-ordering, ortho-polynomial activations, and modular bio-inspired architectures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Research in Systems and Signal Processing · Advanced Data Processing Techniques