Uniform Generalization Bound on Time and Inverse Temperature for Gradient Descent Algorithm and its Application to Analysis of Simulated Annealing
Keisuke Suzuki

TL;DR
This paper establishes a novel uniform generalization bound for SGLD that is independent of time and inverse temperature, and applies it to analyze the effectiveness of simulated annealing in non-convex optimization.
Contribution
It introduces a Rademacher complexity-based generalization bound that removes dependence on time and inverse temperature, advancing theoretical understanding of SGLD and simulated annealing.
Findings
Generalization bound independent of time and inverse temperature
Evaluation of simulated annealing effectiveness in non-convex settings
Sample size and time evaluations with specific convergence rates
Abstract
In this paper, we propose a novel uniform generalization bound on the time and inverse temperature for stochastic gradient Langevin dynamics (SGLD) in a non-convex setting. While previous works derive their generalization bounds by uniform stability, we use Rademacher complexity to make our generalization bound independent of the time and inverse temperature. Using Rademacher complexity, we can reduce the problem to derive a generalization bound on the whole space to that on a bounded region and therefore can remove the effect of the time and inverse temperature from our generalization bound. As an application of our generalization bound, an evaluation on the effectiveness of the simulated annealing in a non-convex setting is also described. For the sample size and time , we derive evaluations with orders and , respectively. Here,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Topological and Geometric Data Analysis · Statistical Methods and Inference
