Astragalus: Automatic Configuration Repair for Production Networks
Zhenrong Gu, Peng Zhang, Xing Feng, Xu Liu

TL;DR
Astragalus is a syntax-driven tool for automatic network configuration repair, demonstrating high effectiveness and speed in repairing errors in production networks without modeling complex semantics.
Contribution
It introduces a novel syntax-driven approach inspired by automatic program repair, enabling scalable and effective network configuration repairs.
Findings
Successfully repaired all incidents in synthesized networks with 15 error types.
Achieved 97.5% repair success rate on real network incidents.
Average repair time was 7.36 seconds, with some repairs completed in under 6 minutes.
Abstract
Network configurations are prone to errors, which can lead to catastrophic service outages. A tool that can achieve automatic configuration repair (ACR) is highly desired by operators. Existing tools for ACR follow a semantic-driven approach: they model network semantics as a set of SMT constraints, and solve them for a location or fix of the error. Due to the complex semantics of networks, constructing and solving these constraints can be prohibitively expensive, making these tools neither general nor scalable. Inspired by automatic program repair (APR), we explore another direction, i.e., a syntax-driven approach, which tries to repair program bugs by ``grafting'' some existing code in the same repository, without modeling program semantics. Following this direction, we propose Astragalus, a syntax-driven method for ACR. It uses multiple iterations of a ``localize-fix-validate''…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
