Diagnosing Distributed Systems through Log Data Analysis
K. R. Chowdhary, Rajendra Purohit

TL;DR
This paper explores log data analysis techniques for diagnosing distributed systems, addressing challenges posed by the lack of direct happen-before relations, and proposes solutions for effective performance analysis.
Contribution
It introduces methods for log-based performance diagnosis in distributed systems, highlighting challenges and offering novel solutions for improved analysis accuracy.
Findings
Effective log analysis methods for centralized systems
Identified challenges in distributed system log analysis
Proposed solutions improve performance diagnosis accuracy
Abstract
The log-based analysis and trouble-shooting has remained prevalent and commonly used approach for centralized and time-haring systems. However, for parallel and distributed systems where happen-before relations are not directly available between the events, it become a challenge to fully depend on log-based analysis in such instances. This article attempts to provide solutions using log-based performance analysis of centralized system, and demonstrates the results and their effectiveness, as well presents the challenges and proposes solutions for performance analysis in distributed and parallel systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
