Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

Bowen Sun; Christos D. Antonopoulos; Evgenia Smirni; Bin Ren; Nikolaos Bellas; Spyros Lalis

arXiv:2602.23935·cs.DC·March 2, 2026

Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

Bowen Sun, Christos D. Antonopoulos, Evgenia Smirni, Bin Ren, Nikolaos Bellas, Spyros Lalis

PDF

Open Access

TL;DR

This paper introduces LACE-RL, a deep reinforcement learning framework that dynamically balances cold-start latency and idle carbon emissions in serverless computing, adapting to real-time conditions.

Contribution

LACE-RL is the first to formulate serverless pod retention as a sequential decision problem using deep RL, optimizing both latency and carbon emissions jointly.

Findings

01

Reduces cold starts by 51.69% compared to static policies.

02

Cuts idle keep-alive carbon emissions by 77.08%.

03

Achieves better latency-carbon trade-offs than existing heuristics.

Abstract

Serverless computing simplifies cloud deployment but introduces new challenges in managing service latency and carbon emissions. Reducing cold-start latency requires retaining warm function instances, while minimizing carbon emissions favors reclaiming idle resources. This balance is further complicated by time-varying grid carbon intensity and varying workload patterns, under which static keep-alive policies are inefficient. We present LACE-RL, a latency-aware and carbon-efficient management framework that formulates serverless pod retention as a sequential decision problem. LACE-RL uses deep reinforcement learning to dynamically tune keep-alive durations, jointly modeling cold-start probability, function-specific latency costs, and real-time carbon intensity. Using the Huawei Public Cloud Trace, we show that LACE-RL reduces cold starts by 51.69% and idle keep-alive carbon emissions by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Distributed and Parallel Computing Systems