Doubly Stochastic Primal-Dual Coordinate Method for Bilinear   Saddle-Point Problem

Adams Wei Yu; Qihang Lin; Tianbao Yang

arXiv:1508.03390·cs.LG·April 13, 2017·2 cites

Doubly Stochastic Primal-Dual Coordinate Method for Bilinear Saddle-Point Problem

Adams Wei Yu, Qihang Lin, Tianbao Yang

PDF

Open Access

TL;DR

This paper introduces a doubly stochastic primal-dual coordinate method for solving bilinear saddle-point problems in empirical risk minimization, demonstrating faster convergence and efficiency improvements over existing methods.

Contribution

The paper presents a novel doubly stochastic primal-dual coordinate algorithm with proven linear convergence for bilinear saddle-point problems, especially effective with structured data or costly proximal mappings.

Findings

01

Lower overall complexity compared to existing methods

02

Proven linear convergence in terms of distance and objective gap

03

Empirical validation on multi-task large margin nearest neighbor problem

Abstract

We propose a doubly stochastic primal-dual coordinate optimization algorithm for empirical risk minimization, which can be formulated as a bilinear saddle-point problem. In each iteration, our method randomly samples a block of coordinates of the primal and dual solutions to update. The linear convergence of our method could be established in terms of 1) the distance from the current iterate to the optimal solution and 2) the primal-dual objective gap. We show that the proposed method has a lower overall complexity than existing coordinate methods when either the data matrix has a factorized structure or the proximal mapping on each block is computationally expensive, e.g., involving an eigenvalue decomposition. The efficiency of the proposed method is confirmed by empirical studies on several real applications, such as the multi-task large margin nearest neighbor problem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods