LASER: Language Model Regression for Semi-Structured Workflow Resource and Runtime Estimation
Yuxuan Yin, Shengke Zhou, Yunjie Zhang, Ajay Mohindra, Boxun Xu, Peng Li

TL;DR
LASER fine-tunes large language models to accurately predict resource usage and runtime for semi-structured cloud workflow jobs, improving efficiency and surpassing existing methods.
Contribution
The paper introduces LASER, a novel framework that leverages LLMs with specialized encoding and decoding techniques for multi-target regression on complex workflow configurations.
Findings
LASER outperforms human experts and SOTA tabular ML baselines.
Scientific notation encoding improves regression accuracy across multiple magnitudes.
Constrained decoding reduces inference latency by over 30%.
Abstract
Accurate prediction of resource consumption and runtime for cloud workflow jobs is critical for scheduling efficiency, yet remains challenging due to the semi-structured nature of job configurations -- comprising shell commands, tool-specific parameters, dependency graphs, and hierarchical metadata. Traditional ML approaches require brittle feature engineering to flatten this rich information into fixed-size vectors, losing critical semantic context. We present LASER, a framework that fine-tunes LLMs on serialized workflow job configurations for multi-target resource and runtime regression. To address the challenges of numerical regression via generation, we introduce scientific notation output encoding for targets spanning multiple orders of magnitude, and constrained decoding with prefix filling to enforce output validity while reducing inference latency by over 30%. We further show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Machine Learning in Materials Science · Cloud Computing and Resource Management
