BatchBench: Toward a Workload-Aware Benchmark for Autoscaling Policies in Big Data Batch Processing -- A Proposed Framework
Venkata Krishna Prasanth Budigi, Siri Chandana Sirigiri

TL;DR
BatchBench is an open framework designed to standardize the benchmarking of various autoscaling policies in big data batch processing, enabling fair comparison across rule-based, learned, and agentic approaches.
Contribution
The paper introduces the design of BatchBench, a comprehensive benchmarking framework including workload taxonomy, generator, evaluation criteria, and a standardized agent interface.
Findings
Proposed a workload taxonomy of six batch processing classes.
Developed a parameterized workload generator with validation methodology.
Defined a five-axis evaluation harness including cost, SLA, responsiveness, thrash, and interpretability.
Abstract
Autoscaling has become a baseline expectation for cloud-native big data processing, and the design space has expanded beyond rule-based heuristics to include learned controllers and, most recently, large language model (LLM) agents. Yet despite a growing body of work spanning these paradigms, the community lacks a shared benchmark for comparing them. Existing evaluations rely on synthetic TPC-style queries, vendor blog posts with proprietary baselines, or narrow trace replays. Each new policy reports favorable numbers against a different baseline, on a different workload, with a different cost model, making cross-paper comparison effectively impossible. This is a position paper. We propose BatchBench, an open benchmarking framework designed to place rule-based, learned, and agentic autoscaling policies on equal experimental footing. The contribution is the design of the framework, not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
