Identifying the Major Sources of Variance in Transaction Latencies: Towards More Predictable Databases
Jiamin Huang, Barzan Mozafari, Grant Schoenebeck, Thomas Wenisch

TL;DR
This paper introduces VProfiler, a framework for identifying sources of variance in transaction latencies in databases, and proposes methods to reduce variance, improving predictability without sacrificing performance.
Contribution
The paper presents VProfiler, a novel profiling tool that decomposes transaction latency variance into function-level components, and applies it to improve MySQL's predictability.
Findings
VProfiler effectively identifies dominant sources of latency variance.
Lock scheduling and logging are primary variance sources in MySQL and Postgres.
Variance reduction techniques decrease latency variance by up to 64%.
Abstract
Decades of research have sought to improve transaction processing performance and scalability in database management systems (DBMSs). However, significantly less attention has been dedicated to the predictability of performance: how often individual transactions exhibit execution latency far from the mean? Performance predictability is vital when transaction processing lies on the critical path of a complex enterprise software or an interactive web service, as well as in emerging database-as-a-service markets where customers contract for guaranteed levels of performance. In this paper, we take several steps towards achieving more predictable database systems. First, we propose a profiling framework called VProfiler that, given the source code of a DBMS, is able to identify the dominant sources of variance in transaction latency. VProfiler automatically instruments the DBMS source code…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed systems and fault tolerance · Advanced Database Systems and Queries
