Knowledge Distillation for Anomaly Detection
Adrian Alan Pol, Ekaterina Govorkova, Sonja Gronroos, Nadezda, Chernyavskaya, Philip Harris, Maurizio Pierini, Isobel Ojalvo, Peter Elmer

TL;DR
This paper introduces a knowledge distillation approach to compress unsupervised anomaly detection models into smaller, supervised models suitable for resource-constrained devices, maintaining performance while reducing size.
Contribution
It proposes a novel knowledge distillation method for unsupervised anomaly detection models and techniques to enhance detection sensitivity.
Findings
Compressed models match larger models in detection performance
Significant reduction in model size and memory usage
Effective deployment on resource-limited devices
Abstract
Unsupervised deep learning techniques are widely used to identify anomalous behaviour. The performance of such methods is a product of the amount of training data and the model size. However, the size is often a limiting factor for the deployment on resource-constrained devices. We present a novel procedure based on knowledge distillation for compressing an unsupervised anomaly detection model into a supervised deployable one and we suggest a set of techniques to improve the detection sensitivity. Compressed models perform comparably to their larger counterparts while significantly reducing the size and memory footprint.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Advanced Malware Detection Techniques
MethodsKnowledge Distillation
