AutoScout: Structured Optimization for Automating ML System Configuration
Jimmy Shong, Yuhan Ding, Yihan Jiang, Liheng Jing, Haonan Chen, Gaokai Zhang, Aditya Akella, Fan Lai

TL;DR
AutoScout is a versatile system that automates ML configuration tuning by formulating it as a hybrid optimization problem, significantly improving training speed across various models and hardware.
Contribution
It introduces a hybrid optimization framework for ML system configuration that handles hierarchical dependencies and reduces profiling costs, outperforming existing heuristics.
Findings
Achieves 2.7-3.0× training speedup over expert tuning.
Effectively handles heterogeneous feature types and dependencies.
Reduces profiling cost through adaptive feature prioritization.
Abstract
Machine learning (ML) systems expose a rapidly expanding configuration space spanning model-parallelism strategies, communication optimizations, and low-level runtime parameters. End-to-end system efficiency is highly sensitive to these choices, yet identifying high-performance configurations is challenging due to heterogeneous feature types (e.g., sparse and dense parameters), conditional dependencies (e.g., valid execution parameters only under specific upstream decisions), and the high search (profiling) cost. Existing approaches either optimize a narrow subset of configuration dimensions or rely on ad-hoc heuristics that fail to generalize as configuration spaces continue to grow. We present AutoScout, a general-purpose systems configurator for ML training, fine-tuning, and inference. It formulates the system configuration as a mixed-discrete/continuous optimization problem with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Parallel Computing and Optimization Techniques · Advanced Neural Network Applications
