CarbonFlex: Enabling Carbon-aware Provisioning and Scheduling for Cloud Clusters
Walid A. Hanafy, Li Wu, David Irwin, Prashant Shenoy

TL;DR
CarbonFlex is a novel cloud cluster scheduling system that reduces carbon emissions by approximately 57% through adaptive, data-driven resource provisioning and scheduling tailored for delay-tolerant batch jobs.
Contribution
It introduces a cluster-level, learning-based approach for carbon-aware resource provisioning and scheduling, addressing the gap in prior work focused on individual jobs.
Findings
Reduces carbon emissions by ~57% compared to baseline.
Achieves near-optimal performance within 2.1% of an oracle scheduler.
Effectively manages multiple parallel jobs in cloud clusters.
Abstract
Accelerating computing demand, largely from AI applications, has led to concerns about its carbon footprint. Fortunately, a significant fraction of computing demand comes from batch jobs that are often delay-tolerant and elastic, which enables schedulers to reduce carbon by suspending/resuming jobs and scaling their resources down/up when carbon is high/low. However, prior work on carbon-aware scheduling generally focuses on optimizing carbon for individual jobs in the cloud, and not provisioning and scheduling resources for many parallel jobs in cloud clusters. To address the problem, we present CarbonFlex, a carbon-aware resource provisioning and scheduling approach for cloud clusters. CarbonFlex leverages continuous learning over historical cluster-level data to drive near-optimal runtime resource provisioning and job scheduling. We implement CarbonFlex by extending AWS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
