Knowledge Distillation-Empowered Digital Twin for Anomaly Detection
Qinghua Xu, Shaukat Ali, Tao Yue, Zaimovic Nedim, and Inderjeet Singh

TL;DR
This paper introduces KDDT, a novel digital twin framework utilizing language models, LSTM, and knowledge distillation to enhance anomaly detection in train control systems, achieving high accuracy with limited data.
Contribution
The paper presents a new digital twin method combining language models, LSTM, and knowledge distillation for improved anomaly detection in cyber-physical systems.
Findings
KDDT achieved F1 scores of 0.931 and 0.915 on real industry datasets.
Knowledge distillation contributed an average 6.05% improvement in F1 score.
The combined approach outperforms individual components in anomaly detection accuracy.
Abstract
Cyber-physical systems (CPSs), like train control and management systems (TCMS), are becoming ubiquitous in critical infrastructures. As safety-critical systems, ensuring their dependability during operation is crucial. Digital twins (DTs) have been increasingly studied for this purpose owing to their capability of runtime monitoring and warning, prediction and detection of anomalies, etc. However, constructing a DT for anomaly detection in TCMS necessitates sufficient training data and extracting both chronological and context features with high quality. Hence, in this paper, we propose a novel method named KDDT for TCMS anomaly detection. KDDT harnesses a language model (LM) and a long short-term memory (LSTM) network to extract contexts and chronological features, respectively. To enrich data volume, KDDT benefits from out-of-domain data with knowledge distillation (KD). We evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
