TL;DR
This paper presents an online deterministic annealing algorithm for classification and clustering that reduces hyper-parameter tuning, avoids poor local minima, and adaptively increases model complexity through an annealing process.
Contribution
It introduces a novel online gradient-free annealing algorithm for prototype-based learning that enhances robustness, interpretability, and online control over complexity.
Findings
Reduces sensitivity to initial conditions.
Provides a natural framework for dissimilarity measures.
Enables online adjustment of model complexity.
Abstract
Inherent in virtually every iterative machine learning algorithm is the problem of hyper-parameter tuning, which includes three major design parameters: (a) the complexity of the model, e.g., the number of neurons in a neural network, (b) the initial conditions, which heavily affect the behavior of the algorithm, and (c) the dissimilarity measure used to quantify its performance. We introduce an online prototype-based learning algorithm that can be viewed as a progressively growing competitive-learning neural network architecture for classification and clustering. The learning rule of the proposed approach is formulated as an online gradient-free stochastic approximation algorithm that solves a sequence of appropriately defined optimization problems, simulating an annealing process. The annealing nature of the algorithm contributes to avoiding poor local minima, offers robustness with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
