ADMoE: Anomaly Detection with Mixture-of-Experts from Noisy Labels
Yue Zhao, Guoqing Zheng, Subhabrata Mukherjee, Robert McCann, Ahmed, Awadallah

TL;DR
ADMoE introduces a novel mixture-of-experts framework that effectively learns from noisy labels for anomaly detection, significantly improving performance over existing methods across multiple datasets.
Contribution
This work presents the first anomaly detection framework that leverages noisy labels using a mixture-of-experts architecture, enabling scalable and specialized learning from multiple noisy sources.
Findings
Up to 34% performance improvement over baseline methods.
Outperforms 13 leading anomaly detection baselines.
Effective on diverse datasets including enterprise security data.
Abstract
Existing works on anomaly detection (AD) rely on clean labels from human annotators that are expensive to acquire in practice. In this work, we propose a method to leverage weak/noisy labels (e.g., risk scores generated by machine rules for detecting malware) that are cheaper to obtain for anomaly detection. Specifically, we propose ADMoE, the first framework for anomaly detection algorithms to learn from noisy labels. In a nutshell, ADMoE leverages mixture-of-experts (MoE) architecture to encourage specialized and scalable learning from multiple noisy sources. It captures the similarities among noisy labels by sharing most model parameters, while encouraging specialization by building "expert" sub-networks. To further juice out the signals from noisy labels, ADMoE uses them as input features to facilitate expert learning. Extensive results on eight datasets (including a proprietary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Malware Detection Techniques · Network Security and Intrusion Detection
