Hestia: Hyperthread-Level Scheduling for Cloud Microservices with Interference-Aware Attention
Dingyu Yang, Fanyong Kong, Jie Dai, Shiyou Qian, Shuangwei Li, Jian Cao, Guangtao Xue, Gang Chen

TL;DR
Hestia is a novel hyperthread-level, interference-aware scheduler for cloud microservices that uses self-attention to model contention patterns, significantly reducing latency and improving resource efficiency.
Contribution
Hestia introduces a self-attention-based predictor and interference scoring model for hyperthread-level scheduling, addressing asymmetric heterogeneity and SMT contention in cloud environments.
Findings
Reduces 95th-percentile latency by up to 80%
Lowers CPU consumption by 2.3% under same workload
Outperforms five state-of-the-art schedulers by up to 30.65%
Abstract
Modern cloud servers routinely co-locate multiple latency-sensitive microservice instances to improve resource efficiency. However, the diversity of microservice behaviors, coupled with mutual performance interference under simultaneous multithreading (SMT), makes large-scale placement increasingly complex. Existing interference aware schedulers and isolation techniques rely on coarse core-level profiling or static resource partitioning, leaving asymmetric hyperthread-level heterogeneity and SMT contention dynamics largely unmodeled. We present Hestia, a hyperthread-level, interference-aware scheduling framework powered by self-attention. Through an extensive analysis of production traces encompassing 32,408 instances across 3,132 servers, we identify two dominant contention patterns -- sharing-core (SC) and sharing-socket (SS) -- and reveal strong asymmetry in their impact. Guided by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Green IT and Sustainability
