Formal Specification for Fast ACS: Low-Latency File-Based Ordered Message Delivery at Scale
Sushant Kumar Gupta, Anil Raghunath Iyer, Chang Yu, Neel Bagora, Olivier Pomerleau, Vivek Kumar, Prunthaban Kanthakumar

TL;DR
Fast ACS is a low-latency, file-based ordered message delivery system designed for large-scale, geographically distributed clusters, ensuring efficient, guaranteed message delivery with minimal resource usage.
Contribution
The paper introduces Fast ACS, a novel file-based messaging system that combines inter- and intra-cluster communication primitives to achieve scalable, low-latency delivery at global scale.
Findings
Successfully deployed in dozens of production clusters
Scales to thousands of consumers with Tbps intra-cluster traffic
Delivers messages within seconds or sub-seconds (p99)
Abstract
Low-latency message delivery is crucial for real-time systems. Data originating from a producer must be delivered to consumers, potentially distributed in clusters across metropolitan and continental boundaries. With the growing scale of computing, there can be several thousand consumers of the data. Such systems require a robust messaging system capable of transmitting messages containing data across clusters and efficiently delivering them to consumers. The system must offer guarantees like ordering and at-least-once delivery while avoiding overload on consumers, allowing them to consume messages at their own pace. This paper presents the design of Fast ACS (an abbreviation for Ads Copy Service), a file-based ordered message delivery system that leverages a combination of two-sided (inter-cluster) and one-sided (intra-cluster) communication primitives - namely, Remote Procedure Call…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-Time Systems Scheduling · Distributed and Parallel Computing Systems · Distributed systems and fault tolerance
