Non-local Optimization: Imposing Structure on Optimization Problems by Relaxation
Nils M\"uller, Tobias Glasmachers

TL;DR
This paper explores the structure of relaxations in stochastic optimization using measure theory and Fourier analysis, revealing properties like Lipschitzness and convexity that facilitate efficient optimization, especially when the original function is non-differentiable or non-convex.
Contribution
It provides a theoretical framework for understanding the structure of relaxations in stochastic optimization, highlighting conditions for effective optimization.
Findings
Optimal relaxation values are consistent with the original function.
Gradients of relaxations are Lipschitz continuous.
Convexity properties enable reliable optimization even with non-differentiable functions.
Abstract
In stochastic optimization, particularly in evolutionary computation and reinforcement learning, the optimization of a function is often addressed through optimizing a so-called relaxation of , where resembles the parameters of a family of probability measures on . We investigate the structure of such relaxations by means of measure theory and Fourier analysis, enabling us to shed light on the success of many associated stochastic optimization methods. The main structural traits we derive and that allow fast and reliable optimization of relaxations are the consistency of optimal values of , Lipschitzness of gradients, and convexity. We emphasize settings where itself is not differentiable or convex, e.g., in the presence of (stochastic) disturbance.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
