Adversarial Robustness through Dynamic Ensemble Learning
Hetvi Waghela, Jaydip Sen, Sneha Rakshit

TL;DR
This paper introduces ARDEL, a dynamic ensemble learning approach that significantly improves the adversarial robustness of pre-trained language models by adaptively reconfiguring ensembles based on input and attack patterns.
Contribution
ARDEL is a novel ensemble scheme that dynamically adjusts model configurations using a meta-model and adversarial detection, enhancing robustness against attacks.
Findings
ARDEL outperforms existing defenses in robustness tests.
Dynamic reconfiguration reduces attack success rates.
Maintains higher accuracy under adversarial conditions.
Abstract
Adversarial attacks pose a significant threat to the reliability of pre-trained language models (PLMs) such as GPT, BERT, RoBERTa, and T5. This paper presents Adversarial Robustness through Dynamic Ensemble Learning (ARDEL), a novel scheme designed to enhance the robustness of PLMs against such attacks. ARDEL leverages the diversity of multiple PLMs and dynamically adjusts the ensemble configuration based on input characteristics and detected adversarial patterns. Key components of ARDEL include a meta-model for dynamic weighting, an adversarial pattern detection module, and adversarial training with regularization techniques. Comprehensive evaluations using standardized datasets and various adversarial attack scenarios demonstrate that ARDEL significantly improves robustness compared to existing methods. By dynamically reconfiguring the ensemble to prioritize the most robust models for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Anomaly Detection Techniques and Applications
MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Warmup With Cosine Annealing · Discriminative Fine-Tuning · Linear Layer · Linear Warmup With Linear Decay · Byte Pair Encoding · Dense Connections
