A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with   the Trace Norm

Wenjie Zheng; Aur\'elien Bellet; Patrick Gallinari

arXiv:1712.07495·cs.DC·May 15, 2018

A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Wenjie Zheng, Aur\'elien Bellet, Patrick Gallinari

PDF

1 Repo

TL;DR

This paper introduces DFW-Trace, a distributed Frank-Wolfe algorithm that efficiently learns low-rank matrices from large-scale, distributed datasets by exploiting trace norm constraints and low-rank structures, with theoretical guarantees and practical validation.

Contribution

The paper presents DFW-Trace, a novel distributed Frank-Wolfe algorithm that leverages low-rank structure for efficient large-scale matrix learning with theoretical convergence analysis.

Findings

01

Achieves sublinear convergence with few power iterations

02

Efficient in time, memory, and communication in distributed settings

03

Validated on synthetic and real datasets including ImageNet

Abstract

We consider the problem of learning a high-dimensional but low-rank matrix from a large-scale dataset distributed over several machines, where low-rankness is enforced by a convex trace norm constraint. We propose DFW-Trace, a distributed Frank-Wolfe algorithm which leverages the low-rank structure of its updates to achieve efficiency in time, memory and communication usage. The step at the heart of DFW-Trace is solved approximately using a distributed version of the power method. We provide a theoretical analysis of the convergence of DFW-Trace, showing that we can ensure sublinear convergence in expectation to an optimal solution with few power iterations per epoch. We implement DFW-Trace in the Apache Spark distributed programming framework and validate the usefulness of our approach on synthetic and real data, including the ImageNet dataset with high-dimensional features extracted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WenjieZ/distributed-frank-wolfe
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.