Gradient-free optimization via integration
Christophe Andrieu, Nicolas Chopin, Ettore Fincato, Mathieu Gerber

TL;DR
This paper introduces a gradient-free optimization method that fits functions to distributions using Bayesian updates and Monte Carlo, enabling convergence analysis and demonstrating effectiveness on complex machine learning tasks.
Contribution
The paper presents a novel model-based search algorithm that optimizes non-convex, non-differentiable functions through distribution fitting and implicit gradient descent, with convergence guarantees.
Findings
Algorithm effectively optimizes complex functions.
Convergence analysis for inhomogeneous gradient descent.
Successful application to a challenging classification task.
Abstract
We develop and analyse an approach to optimize functions not assumed to be convex, differentiable or even continuous. The algorithm belongs to the class of model-based search methods. The idea is to fit recursively to a parametric family of distributions, using a Bayesian update followed by a reprojection back onto the chosen family. Remarkably, reprojection in our scenario boils down to computing expectations, which can be simply approximated through Monte Carlo. We show that when the family of distributions is appropriately chosen this approach can be interpreted as an implicit time-inhomogeneous gradient descent algorithm on a sequence of smoothed approximations of , providing a route to establishing convergence. We establish new results for generic inhomogeneous gradient descent algorithms, which we specialise to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
