On perturbed proximal gradient algorithms

Yves F. Atchade; Gersende Fort; Eric Moulines

arXiv:1402.2365·math.ST·November 22, 2016·J. Mach. Learn. Res.·50 cites

On perturbed proximal gradient algorithms

Yves F. Atchade, Gersende Fort, Eric Moulines

PDF

Open Access

TL;DR

This paper analyzes a perturbed proximal gradient algorithm that uses Monte Carlo methods for gradient approximation, providing convergence guarantees and non-asymptotic bounds for both biased and unbiased cases.

Contribution

It introduces convergence conditions and bounds for a Monte Carlo-based proximal gradient method, covering both biased and unbiased gradient approximations.

Findings

01

Convergence is guaranteed under specific step size and batch size conditions.

02

Non-asymptotic bounds are derived for the averaged algorithm.

03

Applications include sparse generalized linear models and graphical model learning.

Abstract

We study a version of the proximal gradient algorithm for which the gradient is intractable and is approximated by Monte Carlo methods (and in particular Markov Chain Monte Carlo). We derive conditions on the step size and the Monte Carlo batch size under which convergence is guaranteed: both increasing batch size and constant batch size are considered. We also derive non-asymptotic bounds for an averaged version. Our results cover both the cases of biased and unbiased Monte Carlo approximation. To support our findings, we discuss the inference of a sparse generalized linear model with random effect and the problem of learning the edge structure and parameters of sparse undirected graphical models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Numerical methods in inverse problems