Tele-Knowledge Pre-training for Fault Analysis
Zhuo Chen, Wen Zhang, Yufeng Huang, Mingyang Chen, Yuxia Geng, Hongtao, Yu, Zhen Bi, Yichi Zhang, Zhen Yao, Wenting Song, Xinliang Wu, Yi Yang,, Mingyi Chen, Zhaoyang Lian, Yingying Li, Lei Cheng, Huajun Chen

TL;DR
This paper introduces TeleBERT and KTeleBERT, pre-trained language models on telecommunication data, which enhance fault analysis tasks like root-cause analysis, event prediction, and fault tracing by integrating tele-knowledge.
Contribution
The paper presents a novel tele-knowledge graph and two domain-specific language models, demonstrating improved fault analysis performance in telecommunication applications.
Findings
Pre-training on tele-related corpora improves downstream fault analysis tasks.
Re-training with tele-knowledge further enhances model performance.
Knowledge injection methods effectively incorporate tele-domain information.
Abstract
In this work, we share our experience on tele-knowledge pre-training for fault analysis, a crucial task in telecommunication applications that requires a wide range of knowledge normally found in both machine log data and product documents. To organize this knowledge from experts uniformly, we propose to create a Tele-KG (tele-knowledge graph). Using this valuable data, we further propose a tele-domain language pre-training model TeleBERT and its knowledge-enhanced version, a tele-knowledge re-training model KTeleBERT. which includes effective prompt hints, adaptive numerical data encoding, and two knowledge injection paradigms. Concretely, our proposal includes two stages: first, pre-training TeleBERT on 20 million tele-related corpora, and then re-training it on 1 million causal and machine-related corpora to obtain KTeleBERT. Our evaluation on multiple tasks related to fault analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques
