Carbon-Aware Quality Adaptation for Energy-Intensive Services
Philipp Wiesner, Dennis Grinwald, Philipp Wei{\ss}, Patrick Wilhelm, Ramin Khalili, Odej Kao

TL;DR
This paper proposes a novel approach to reduce the carbon footprint of energy-intensive cloud services by dynamically adjusting service quality based on grid carbon intensity, achieving significant emissions savings.
Contribution
It introduces a forecast-based multi-horizon optimization method for adaptive service quality management to minimize carbon emissions in real-time.
Findings
Up to 10% reduction in CO2 emissions for large-scale LLM services.
Effective adaptation of service quality can complement resource efficiency strategies.
The approach is applicable under constraints requiring constant service availability.
Abstract
The energy demand of modern cloud services, particularly those related to generative AI, is increasing at an unprecedented pace. To date, carbon-aware computing strategies have primarily focused on batch process scheduling or geo-distributed load balancing. However, such approaches are not applicable to services that require constant availability at specific locations due to latency, privacy, data, or infrastructure constraints. In this paper, we explore how the carbon footprint of energy-intensive services can be reduced by adjusting the fraction of requests served by different service quality tiers. We show that adapting this quality of responses with respect to grid carbon intensity can lead to additional carbon savings beyond resource and energy efficiency. Building on this, we introduce a forecast-based multi-horizon optimization that reaches close-to-optimal carbon savings and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGreen IT and Sustainability · Cloud Computing and Resource Management · Caching and Content Delivery
