Turning Privacy Constraints into Syslog Analysis Advantage
Siavash Ghiasvand, Florina M. Ciorba, Wolfgang E. Nagel

TL;DR
This paper presents a method that transforms privacy constraints in syslog analysis into an advantage, enabling efficient failure detection in HPC systems while protecting user privacy.
Contribution
It introduces a privacy-preserving syslog analysis approach that reduces storage and processing time, turning privacy constraints into an analytical benefit.
Findings
Significant reduction in storage space needed for syslogs
Processing time is three times shorter
Effective early failure detection in HPC systems
Abstract
The mean time between failures (MTBF) of HPC systems is rapidly reducing, and that current failure recovery mechanisms e.g., checkpoint-restart, will no longer be able to recover the systems from failures. Early failure detection is a new class of failure recovery methods that can be beneficial for HPC systems with short MTBF. System logs (syslogs) are invaluable source of information which give us a deep insight about system behavior, and make the early failure detection possible. Beside normal information, syslogs contain sensitive data which might endanger users' privacy. Even though analyzing various syslogs is necessary for creating a general failure detection/prediction method, privacy concerns discourage system administrators to publish syslogs. Herein, we ensure user privacy via de-identifying syslogs, and then turning the applied constraint for addressing users' privacy into an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Cognitive Functions and Memory · Age of Information Optimization
