Data Integrity Error Localization in Networked Systems with Missing Data
Yufeng Xin, Shih-Wen Fu, Anirban Mandal, Ryan Tanaka, Mats Rynge,, Karan Vahi, Ewa Deelman

TL;DR
This paper introduces a novel multi-output prediction model for network failure localization in wide-area networks, effectively handling missing data through multivariate imputation, and demonstrating significant performance improvements.
Contribution
It presents the first application of imputation techniques to address missing data in network failure localization, enhancing accuracy in wide-area network diagnosis.
Findings
Multivariate imputation improves failure localization accuracy.
The model effectively handles incomplete measurement data.
Significant performance gains with selected imputation algorithms.
Abstract
Most recent network failure diagnosis systems focused on data center networks where complex measurement systems can be deployed to derive routing information and ensure network coverage in order to achieve accurate and fast fault localization. In this paper, we target wide-area networks that support data-intensive distributed applications. We first present a new multi-output prediction model that directly maps the application level observations to localize the system component failures. In reality, this application-centric approach may face the missing data challenge as some input (feature) data to the inference models may be missing due to incomplete or lost measurements in wide area networks. We show that the presented prediction model naturally allows the {\it multivariate} imputation to recover the missing data. We evaluate multiple imputation algorithms and show that the prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Traffic Prediction and Management Techniques · Anomaly Detection Techniques and Applications
