DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
Sigma Jahan, Saurabh Singh Rajput, Tushar Sharma, Mohammad Masudur Rahman

TL;DR
DEFault++ is a hierarchical, learning-based diagnostic tool that detects, categorizes, and identifies root causes of faults in transformer models, significantly improving fault diagnosis accuracy.
Contribution
It introduces DEFault++, a novel hierarchical diagnostic method and DEFault-bench, a large benchmark dataset for transformer fault analysis.
Findings
Achieves over 0.96 AUROC in fault detection.
Macro-F1 score of 0.85 in fault categorization and diagnosis.
Increases developer diagnosis accuracy from 57.1% to 83.3%.
Abstract
Transformer models are widely deployed in critical AI applications, yet faults in their attention mechanisms, projections, and other internal components often degrade behavior silently without raising runtime errors. Existing fault diagnosis techniques often target generic deep neural networks and cannot identify which transformer component is responsible for an observed symptom. In this article, we present DEFault++, a hierarchical learning-based diagnostic technique that operates at three level of abstraction: it detects whether a fault is present, classifies it into one of 12 transformer-specific fault categories (covering both attention-internal mechanisms and surrounding architectural components), and identifies the underlying root cause from up to 45 mechanisms. To facilitate both training and evaluation, we construct DEFault-bench, a benchmark of 3,739 labeled instances obtained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
