On Seven Fundamental Optimization Challenges in Machine Learning
Konstantin Mishchenko

TL;DR
This paper addresses seven key optimization challenges in machine learning, providing theoretical guarantees, new algorithms, and improved methods for federated learning, adaptive optimization, and distributed systems.
Contribution
It introduces novel solutions and theoretical insights for seven fundamental optimization problems inspired by machine learning and federated learning.
Findings
First theoretical guarantees for Local SGD in heterogeneous data regimes
Closing the gap for Random Reshuffling and Shuffle-Once algorithms
Development of FedRR algorithm that outperforms gradient descent in communication efficiency
Abstract
Many recent successes of machine learning went hand in hand with advances in optimization. The exchange of ideas between these fields has worked both ways, with machine learning building on standard optimization procedures such as gradient descent, as well as with new directions in the optimization theory stemming from machine learning applications. In this thesis, we discuss new developments in optimization inspired by the needs and practice of machine learning, federated learning, and data science. In particular, we consider seven key challenges of mathematical optimization and develop a solution to each. Our first contribution is the resolution of a key open problem in Federated Learning: we establish the first theoretical guarantees for the famous Local SGD algorithm in the heterogeneous data regime. As the second challenge, we close the gap between the upper and lower bounds for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Cooperative Communication and Network Coding
