Seeing Through Black Boxes : Tracking Transactions through Queues under Monitoring Resource Constraints
Animashree Anandkumar, Ting He, Chatschik Bisdikian, and Dakshi, Agrawal

TL;DR
This paper addresses the challenge of optimally allocating limited monitoring resources in distributed queueing systems to improve transaction tracking accuracy, balancing between precise identifiers and imprecise timestamps.
Contribution
It introduces two heuristics for resource allocation based on predicting timestamp accuracy, with theoretical conditions for near-optimality and validation through simulations.
Findings
Heuristics perform close to optimal in simulations.
Optimal allocation depends on queueing dynamics and resource constraints.
Conditions for heuristic optimality are derived and validated.
Abstract
The problem of optimal allocation of monitoring resources for tracking transactions progressing through a distributed system, modeled as a queueing network, is considered. Two forms of monitoring information are considered, viz., locally unique transaction identifiers, and arrival and departure timestamps of transactions at each processing queue. The timestamps are assumed available at all the queues but in the absence of identifiers, only enable imprecise tracking since parallel processing can result in out-of-order departures. On the other hand, identifiers enable precise tracking but are not available without proper instrumentation. Given an instrumentation budget, only a subset of queues can be selected for production of identifiers, while the remaining queues have to resort to imprecise tracking using timestamps. The goal is then to optimally allocate the instrumentation budget to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Distributed systems and fault tolerance · Age of Information Optimization
