ThreatFormer-IDS: Robust Transformer Intrusion Detection with Zero-Day Generalization and Explainable Attribution
Srikumar Nayak

TL;DR
ThreatFormer-IDS is a Transformer-based intrusion detection system that enhances robustness, zero-day attack generalization, and explainability in IoT and industrial networks through advanced learning and attribution techniques.
Contribution
It introduces a novel Transformer framework combining supervised, self-supervised, adversarial training, and attribution for robust, explainable intrusion detection.
Findings
Achieves high detection accuracy on IoT benchmarks.
Maintains strong performance on unseen attack types.
Demonstrates robustness against adversarial feature manipulations.
Abstract
Intrusion detection in IoT and industrial networks requires models that can detect rare attacks at low false-positive rates while remaining reliable under evolving traffic and limited labels. Existing IDS solutions often report strong in-distribution accuracy, but they may degrade when evaluated on future traffic, unseen (zero-day) attack families, or adversarial feature manipulations, and many systems provide limited evidence to support analyst triage. To address these gaps, we propose ThreatFormer- IDS, a Transformer-based sequence modeling framework that converts flow records into time-ordered windows and learns contextual representations for robust intrusion screening. The method combines (i) weighted supervised learning for imbalanced detection, (ii) masked self-supervised learning to improve representation stability under drift and sparse labels, (iii) PGDbased adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning
