Hemingway: Modeling Distributed Optimization Algorithms
Xinghao Pan, Shivaram Venkataraman, Zizheng Tai, Joseph Gonzalez

TL;DR
Hemingway is a proposed ML-based system that models distributed optimization algorithms to automatically select suitable algorithms and cluster sizes, improving performance and convergence in machine learning tasks.
Contribution
The paper introduces Hemingway, a prototype system that models system and convergence characteristics to optimize distributed algorithm selection and cluster sizing.
Findings
Preliminary results demonstrate potential of Hemingway in selecting optimal configurations.
Modeling system and convergence behaviors aids in improving distributed optimization efficiency.
Challenges include accurately capturing system dynamics and convergence patterns.
Abstract
Distributed optimization algorithms are widely used in many industrial machine learning applications. However choosing the appropriate algorithm and cluster size is often difficult for users as the performance and convergence rate of optimization algorithms vary with the size of the cluster. In this paper we make the case for an ML-optimizer that can select the appropriate algorithm and cluster size to use for a given problem. To do this we propose building two models: one that captures the system level characteristics of how computation, communication change as we increase cluster sizes and another that captures how convergence rates change with cluster sizes. We present preliminary results from our prototype implementation called Hemingway and discuss some of the challenges involved in developing such a system.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Complexity and Algorithms in Graphs
