Stochastic Langevin Differential Inclusions with Applications to Machine Learning
Fabio V. Difonzo, Vyacheslav Kungurtsev, Jakub Marecek

TL;DR
This paper extends the theoretical understanding of Langevin-type stochastic differential inclusions, crucial for machine learning, by establishing foundational results including solution existence and asymptotic behavior under non-smooth potential functions.
Contribution
It introduces foundational results for Langevin-type stochastic differential inclusions with non-smooth potentials, relevant to machine learning applications.
Findings
Proves strong existence of solutions for Langevin-type stochastic differential inclusions.
Demonstrates asymptotic minimization of the free-energy functional.
Addresses non-Lipschitz drift scenarios in machine learning contexts.
Abstract
Stochastic differential equations of Langevin-diffusion form have received significant attention, thanks to their foundational role in both Bayesian sampling algorithms and optimization in machine learning. In the latter, they serve as a conceptual model of the stochastic gradient flow in training over-parameterized models. However, the literature typically assumes smoothness of the potential, whose gradient is the drift term. Nevertheless, there are many problems for which the potential function is not continuously differentiable, and hence the drift is not Lipschitz continuous everywhere. This is exemplified by robust losses and Rectified Linear Units in regression problems. In this paper, we show some foundational results regarding the flow and asymptotic properties of Langevin-type Stochastic Differential Inclusions under assumptions appropriate to the machine-learning settings. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
