On the Computational Power of Online Gradient Descent

Vaggos Chatziafratis; Tim Roughgarden; Joshua R. Wang

arXiv:1807.01280·cs.LG·February 7, 2019

On the Computational Power of Online Gradient Descent

Vaggos Chatziafratis, Tim Roughgarden, Joshua R. Wang

PDF

Open Access

TL;DR

This paper demonstrates that online gradient descent can simulate complex computations, making it computationally hard to analyze its behavior precisely, which has implications for understanding its limitations.

Contribution

It shows that online gradient descent can encode arbitrary polynomial-space computations, revealing its high computational complexity.

Findings

01

Online gradient descent can simulate polynomial-space computations.

02

It is computationally hard to analyze the detailed behavior of online gradient descent.

03

Implications for the limits of reasoning about online learning algorithms.

Abstract

We prove that the evolution of weight vectors in online gradient descent can encode arbitrary polynomial-space computations, even in very simple learning settings. Our results imply that, under weak complexity-theoretic assumptions, it is impossible to reason efficiently about the fine-grained behavior of online gradient descent.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques