Efficient Two-Layered Monitor for Partially Synchronous Distributed Systems (Technical Report)
Vidhya Tekken Valapil, Sandeep Kulkarni, Eric Torng, and Gabe Appleton

TL;DR
This paper introduces a novel two-layered monitoring system for distributed systems that significantly reduces monitoring costs by combining an efficient imprecise layer with a precise but less efficient layer, enabling the use of small clocks.
Contribution
The paper proposes a new two-layered monitoring approach that overcomes time and space limitations of previous methods, enabling efficient and precise system correctness monitoring.
Findings
Reduces monitoring costs by 85-95%.
Allows use of O(1) sized Hybrid Logical Clocks.
Balances efficiency and precision effectively.
Abstract
Monitoring distributed systems to ensure their correctness is a challenging and expensive but essential problem. It is challenging because while execution of a distributed system creates a partial order among events, the monitor will typically observe only one serialization of that partial order. This means that even if the observed serialization is consistent with the system specifications, the monitor cannot assume that the system is correct because some other unobserved serialization can be inconsistent with the system specifications. Existing solutions that guarantee identification of all such unobserved violations require some combination of lots of time and large clocks, e.g. O(n) sized Vector Clocks. We present a new, efficient two-layered monitoring approach that overcomes both the time and space limitations of earlier monitors. The first layer is imprecise but efficient and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Real-Time Systems Scheduling · Parallel Computing and Optimization Techniques
