Randomized dual proximal gradient for large-scale distributed   optimization

Ivano Notarnicola; Giuseppe Notarstefano

arXiv:1609.05713·math.OC·September 20, 2016

Randomized dual proximal gradient for large-scale distributed optimization

Ivano Notarnicola, Giuseppe Notarstefano

PDF

TL;DR

This paper introduces a randomized dual proximal gradient method for large-scale distributed optimization, enabling asynchronous updates over networks with separable, possibly non-smooth, convex functions.

Contribution

It proposes a novel asynchronous distributed algorithm based on randomized block-coordinate proximal gradient on the dual problem, handling non-smooth separable functions.

Findings

01

Algorithm converges under specified conditions.

02

Enables efficient distributed optimization over undirected networks.

03

Handles non-smooth regularization and constraints.

Abstract

In this paper we consider distributed optimization problems in which the cost function is separable (i.e., a sum of possibly non-smooth functions all sharing a common variable) and can be split into a strongly convex term and a convex one. The second term is typically used to encode constraints or to regularize the solution. We propose an asynchronous, distributed optimization algorithm over an undirected topology, based on a proximal gradient update on the dual problem. We show that by means of a proper choice of primal variables, the dual problem is separable and the dual variables can be stacked into separate blocks. This allows us to show that a distributed gossip update can be obtained by means of a randomized block-coordinate proximal gradient on the dual function.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.