Deadline-Aware Online Scheduling for LLM Fine-Tuning with Spot Market Predictions
Linggao Kong, Yuedong Xu, Lei Jiao, Chuan Xu

TL;DR
This paper presents an online scheduling framework for cost-efficient LLM fine-tuning using spot and on-demand GPU instances, leveraging market predictions and adaptive policy selection to handle market volatility.
Contribution
It introduces a prediction-based online scheduling algorithm with a novel commitment level and a policy selection method, improving cost and deadline adherence in volatile spot markets.
Findings
Prediction-based algorithm achieves tighter bounds with lower errors.
Policy selection algorithm has a regret of O(√T).
Framework outperforms baselines, increasing utility by up to 54.8%.
Abstract
As foundation models grow in size, fine-tuning them becomes increasingly expensive. While GPU spot instances offer a low-cost alternative to on-demand resources, their volatile prices and availability make deadline-aware scheduling particularly challenging. We tackle this difficulty by using a mix of spot and on-demand instances. Distinctively, we show the predictability of prices and availability in a spot instance market, the power of prediction in enabling cost-efficient scheduling and its sensitivity to estimation errors. An integer programming problem is formulated to capture the use of mixed instances under both the price and availability dynamics. We propose an online allocation algorithm with prediction based on the committed horizon control approach that leverages a \emph{commitment level} to enforce the partial sequence of decisions. When this prediction becomes inaccurate, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Constraint Satisfaction and Optimization · Embedded Systems Design Techniques
