Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling
Prachi Jadhav, Hongwei Jin, Ewa Deelman, Prasanna Balaprakash

TL;DR
This paper explores the use of Large Language Models with a ReAct framework for multiobjective HPC job scheduling, demonstrating their ability to balance conflicting goals and adapt to diverse workloads with interpretability.
Contribution
It introduces a novel LLM-based scheduler that uses iterative reasoning and natural language feedback, marking a first step towards intelligent, adaptable HPC scheduling solutions.
Findings
LLM-based scheduler balances multiple objectives effectively.
The approach offers transparent reasoning through natural language traces.
It adapts well to diverse and complex HPC workloads.
Abstract
High-Performance Computing (HPC) job scheduling involves balancing conflicting objectives such as minimizing makespan, reducing wait times, optimizing resource use, and ensuring fairness. Traditional methods, including heuristic-based, e.g., First-Come-First-Served (FJFS) and Shortest Job First (SJF), or intensive optimization techniques, often lack adaptability to dynamic workloads and, more importantly, cannot simultaneously optimize multiple objectives in HPC systems. To address this, we propose a novel Large Language Model (LLM)-based scheduler using a ReAct-style framework (Reason + Act), enabling iterative, interpretable decision-making. The system incorporates a scratchpad memory to track scheduling history and refine decisions via natural language feedback, while a constraint enforcement module ensures feasibility and safety. We evaluate our approach using OpenAI's O4-Mini and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Scheduling and Optimization Algorithms · Service-Oriented Architecture and Web Services
