Diagonal Scaling: A Multi-Dimensional Resource Model and Optimization Framework for Distributed Databases
Shahir Abdullah, Syed Rohit Zaman

TL;DR
This paper introduces the Scaling Plane, a two-dimensional resource model for distributed databases, and proposes DIAGONALSCALE, a diagonal scaling algorithm that improves performance and cost efficiency.
Contribution
It presents a novel multi-dimensional scaling model and an optimization algorithm that jointly adjusts node count and resources for better cloud database autoscaling.
Findings
Diagonal scaling reduces p95 latency by up to 40%
It lowers cost-per-query by up to 37%
It achieves 2 to 5 times less rebalancing compared to traditional methods
Abstract
Modern cloud databases present scaling as a binary decision: scale-out by adding nodes or scale-up by increasing per-node resources. This one-dimensional view is limiting because database performance, cost, and coordination overhead emerge from the joint interaction of horizontal elasticity and per-node CPU, memory, network bandwidth, and storage IOPS. As a result, systems often overreact to load spikes, underreact to memory pressure, or oscillate between suboptimal states. We introduce the Scaling Plane, a two-dimensional model in which each distributed database configuration is represented as a point (H, V), with H denoting node count and V a vector of resources. Over this plane, we define smooth approximations of latency, throughput, coordination overhead, and monetary cost, providing a unified view of performance trade-offs. We show analytically and empirically that optimal scaling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
