RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection
Sami Abuzakuk, Lucas Crijns, Anne-Marie Kermarrec, Rafael Pires, Martijn de Vos

TL;DR
RIVA is a multi-agent system that enhances the reliability of infrastructure as code verification by effectively handling erroneous tool outputs through collaborative cross-validation, significantly improving accuracy in detecting configuration drift.
Contribution
The paper introduces RIVA, a novel multi-agent framework that improves IaC verification reliability by using collaborative cross-validation to mitigate erroneous tool responses.
Findings
RIVA increases task accuracy from 27.3% to 50% with erroneous responses.
RIVA improves accuracy from 28% to 43.8% without errors.
Cross-validation enhances autonomous infrastructure verification reliability.
Abstract
Infrastructure as code (IaC) tools automate cloud provisioning but verifying that deployed systems remain consistent with the IaC specifications remains challenging. Such configuration drift occurs because of bugs in the IaC specification, manual changes, or system updates. Large language model (LLM)-based agentic AI systems can automate the analysis of large volumes of telemetry data, making them suitable for the detection of configuration drift. However, existing agentic systems implicitly assume that the tools they invoke always return correct outputs, making them vulnerable to erroneous tool responses. Since agents cannot distinguish whether an anomalous tool output reflects a real infrastructure problem or a broken tool, such errors may cause missed drift or false alarms, reducing reliability precisely when it is most needed. We introduce RIVA (Robust Infrastructure by Verification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Data Stream Mining Techniques · Software Testing and Debugging Techniques
