Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms
Cheng Ji, Huaiying Luo

TL;DR
This paper presents an innovative framework leveraging large language models for intelligent log processing and autonomous debugging in cloud AI platforms, significantly enhancing fault localization accuracy and system self-repair capabilities.
Contribution
It introduces a novel LLM-based debugging framework with multi-stage semantic inference, dynamic log structuring, and reinforcement learning-driven decision-making for cloud AI systems.
Findings
Improves fault location accuracy by 16.2%.
Enhances semantic understanding and adaptive debugging.
Supports heterogeneous cloud environments.
Abstract
With the increasing complexity and rapid expansion of the scale of AI systems in cloud platforms, the log data generated during system operation is massive, unstructured, and semantically ambiguous, which brings great challenges to fault location and system self-repair. In order to solve this problem, this paper proposes an intelligent log processing and automatic debugging framework based on Large Language Model (LLM), named Intelligent Debugger (LLM-ID). This method is extended on the basis of the existing pre-trained Transformer model, and integrates a multi-stage semantic inference mechanism to realize the context understanding of system logs and the automatic reconstruction of fault chains. Firstly, the system log is dynamically structured, and the unsupervised clustering and embedding mechanism is used to extract the event template and semantic schema. Subsequently, the fine-tuned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLayer Normalization · Dropout · Absolute Position Encodings · Dense Connections · Byte Pair Encoding · Softmax · Label Smoothing · Transformer
