TimeTrader: Exploiting Latency Tail to Save Datacenter Energy for On-line Data-Intensive Applications
Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, T. N. Vijaykumar

TL;DR
TimeTrader reduces datacenter energy consumption for online data-intensive applications by exploiting latency slack in sub-critical replies, reshaping response time distribution without increasing missed deadlines, and achieving significant energy savings over previous methods.
Contribution
It introduces TimeTrader, a novel approach that exploits latency slack in all loads by slowing down nodes without missing deadlines, outperforming prior load-based energy saving techniques.
Findings
Achieves 15-19% energy savings at 90% load.
Achieves 41-49% energy savings at 30% load.
Outperforms previous work in energy efficiency without increasing missed deadlines.
Abstract
Datacenters running on-line, data-intensive applications (OLDIs) consume significant amounts of energy. However, reducing their energy is challenging due to their tight response time requirements. A key aspect of OLDIs is that each user query goes to all or many of the nodes in the cluster, so that the overall time budget is dictated by the tail of the replies' latency distribution; replies see latency variations both in the network and compute. Previous work proposes to achieve load-proportional energy by slowing down the computation at lower datacenter loads based directly on response times (i.e., at lower loads, the proposal exploits the average slack in the time budget provisioned for the peak load). In contrast, we propose TimeTrader to reduce energy by exploiting the latency slack in the sub- critical replies which arrive before the deadline (e.g., 80% of replies are 3-4x faster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques
