Dropout Robustness and Cognitive Profiling of Transformer Models via Stochastic Inference
Ant\^onio Junior Alves Caiado, Michael Hahsler

TL;DR
This paper systematically evaluates dropout-induced variability in 19 transformer models using MC Dropout, revealing architecture-dependent robustness and providing insights for uncertainty-aware applications.
Contribution
It introduces the first comprehensive MC Dropout benchmark for transformers and a cognitive profiling framework to analyze model robustness and performance components.
Findings
Smaller models are prediction-stable; medium-sized models show volatility.
Over half of the models suffer significant accuracy loss under dropout.
Memory tasks are more affected by dropout than reasoning tasks.
Abstract
Transformer-based language models are widely deployed for reasoning, yet their behavior under inference-time stochasticity remains underexplored. While dropout is common during training, its inference-time effects via Monte Carlo sampling lack systematic evaluation across architectures, limiting understanding of model reliability in uncertainty-aware applications. This work analyzes dropout-induced variability across 19 transformer models using MC Dropout with 100 stochastic forward passes per sample. Dropout robustness is defined as maintaining high accuracy and stable predictions under stochastic inference, measured by standard deviation of per-run accuracies. A cognitive decomposition framework disentangles performance into memory and reasoning components. Experiments span five dropout configurations yielding 95 unique evaluations on 1,000 samples. Results reveal substantial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Explainable Artificial Intelligence (XAI) · Big Data and Digital Economy
