New Tools for Peak Memory Scheduling
Ce Jin, Manish Purohit, Zoya Svitkina, Erik Vee, Joshua R. Wang

TL;DR
This paper introduces new tools for scheduling computation graphs to minimize peak memory, proving the existence of dominant schedules, analyzing their complexity, and providing efficient algorithms for specific graph classes.
Contribution
It defines dominant schedules for computation graphs, proves their existence, analyzes complexity, and develops fixed-parameter algorithms for series-parallel graphs.
Findings
Dominant schedules exist for all computation graphs.
Weighted one-shot black pebbling is NP-complete even for simple graphs.
Fixed-parameter tractable algorithms are developed for series-parallel graphs.
Abstract
We study scheduling of computation graphs to minimize peak memory consumption, an increasingly critical task due to the surge in popularity of large deep-learning models. This problem corresponds to the weighted version of the classical one-shot black pebbling game. We propose the notion of a dominant schedule to capture the idea of finding the ``best'' schedule for a subgraph and introduce new tools to compute and utilize dominant schedules. Surprisingly, we show that despite the strong requirements, a dominant schedule exists for any computation graph; and, moreover, that it is possible to compute the dominant schedule efficiently whenever we can find optimal schedules efficiently for a particular class of graphs (under mild technical conditions). We apply these new tools to analyze trees and series-parallel graphs. We show that the weighted one-shot black pebbling game is strongly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
New Tools for Peak Memory Scheduling· youtube
Taxonomy
TopicsParallel Computing and Optimization Techniques · Graph Theory and Algorithms · Complexity and Algorithms in Graphs
