Message Passing Descent for Efficient Machine Learning
Francesco Concetti, Michael Chertkov

TL;DR
This paper introduces Message Passing Descent, a novel optimization algorithm for machine learning that uses graphical models and non-local updates to improve over traditional gradient descent, especially in complex landscapes.
Contribution
The paper presents a new message passing descent algorithm that leverages graphical models and non-local parameter updates for more effective optimization in neural network training.
Findings
MPD outperforms gradient descent in experiments.
MPD avoids local minima due to non-local updates.
Algorithm demonstrated on neural networks with piece-wise-linear activations.
Abstract
We propose a new iterative optimization method for the {\bf Data-Fitting} (DF) problem in Machine Learning, e.g. Neural Network (NN) training. The approach relies on {\bf Graphical Model} (GM) representation of the DF problem, where variables are fitting parameters and factors are associated with the Input-Output (IO) data. The GM results in the {\bf Belief Propagation} Equations considered in the {\bf Large Deviation Limit} corresponding to the practically important case when the number of the IO samples is much larger than the number of the fitting parameters. We suggest the {\bf Message Passage Descent} algorithm which relies on the piece-wise-polynomial representation of the model DF function. In contrast with the popular gradient descent and related algorithms our MPD algorithm rely on analytic (not automatic) differentiation, while also (and most importantly) it descents through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
