Scheduling to Optimize Sojourn Time of Successful Jobs
Yuan Yao, Marco Paolieri, Leana Golubchik

TL;DR
This paper develops scheduling policies that specifically optimize the sojourn time of successful jobs in iterative computing, demonstrating improved performance over existing methods through theoretical proofs and extensive simulations.
Contribution
It introduces a non-preemptive optimal scheduling policy for single-server scenarios and an online approach for multi-server, dynamic environments, focusing on successful job sojourn time.
Findings
Optimal schedules are non-preemptive for single-server cases.
The proposed policy asymptotically minimizes sojourn time of successful jobs.
Simulation results show better performance than existing techniques.
Abstract
Deep neural networks training jobs and other iterative computations frequently include checkpoints where jobs can be canceled based on the current value of monitored metrics. While most of existing results focus on the performance of all jobs (both successfully completed and canceled), in this work we explore scheduling policies that improve the sojourn time of successful jobs, which are typically more valuable to the user. Our model assumes that each job has a known discrete size distribution (e.g., estimated from previous execution logs) where the largest size value indicates a successful completion, while other size values correspond to termination checkpoints. In the single-server case where all jobs are available for scheduling simultaneously, we prove that optimal schedules do not preempt jobs, even when preemption overhead is negligible. Based on this, we develop a scheduling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Age of Information Optimization · Advanced Bandit Algorithms Research
