10-millisecond Computing
Gang Lu, Jianfeng Zhan, Tianshu Hao, and Lei Wang

TL;DR
This paper introduces the concept of 10-millisecond computing, emphasizing the need for systems to deliver responses within tens of milliseconds, and analyzes the challenges and potential design strategies for achieving this on current hardware and software stacks.
Contribution
It defines 10-millisecond computing, proposes a metric for quality of service, and quantitatively evaluates the challenges in existing systems to meet this latency requirement.
Findings
Reducing outlier proportion to 10% requires an 871X to 2372X reduction in latency outliers.
Current Linux, LXC, and XEN systems need significant improvements to meet 10-ms latency goals.
Discusses design space options across architectures, networking, OS, and benchmarking for 10-ms computing.
Abstract
Despite computation becomes much complex on data with an unprecedented scale, we argue computers or smart devices should and will consistently provide information and knowledge to human being in the order of a few tens milliseconds. We coin a new term 10-millisecond computing to call attention to this class of workloads. 10-millisecond computing raises many challenges for both software and hardware stacks. In this paper, using a typical workload-memcached on a 40-core server (a main-stream server in near future), we quantitatively measure 10-ms computing's challenges to conventional operating systems. For better communication, we propose a simple metric-outlier proportion to measure quality of service: for N completed requests or jobs, if M jobs or requests' latencies exceed the outlier threshold t, the outlier proportion is M/N . For a 1K-scale system running Linux (version 2.6.32),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Parallel Computing and Optimization Techniques · Quantum Computing Algorithms and Architecture
