Agon: A Scalable Competitive Scheduler for Large Heterogeneous Systems
Andreas Prodromou, Ashish Venkat, Dean M. Tullsen

TL;DR
Agon is a neural network-based scheduler that efficiently balances accuracy and overhead in large heterogeneous multicore systems, significantly improving performance.
Contribution
Introduces Agon, a scalable, neural network-based scheduler that adapts to system noise and balances scheduler complexity for optimal performance.
Findings
Achieves 6% average performance improvement.
Approaches 99.1% of oracle scheduler performance.
Effectively manages noise in performance predictions.
Abstract
This work proposes a competitive scheduling approach, designed to scale to large heterogeneous multicore systems. This scheduler overcomes the challenges of (1) the high computation overhead of near-optimal schedulers, and (2) the error introduced by inaccurate performance predictions. This paper presents Agon, a neural network-based classifier that selects from a range of schedulers, from simple to very accurate, and learns which scheduler provides the right balance of accuracy and overhead for each scheduling interval. Agon also employs a de-noising frontend allowing the individual schedulers to be tolerant towards noise in performance predictions, producing better overall schedules. By avoiding expensive scheduling overheads, Agon improves average system performance by 6\% on average, approaching the performance of an oracular scheduler (99.1% of oracle performance).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Advanced Data Storage Technologies
