Adversarial Robustness through Dynamic Ensemble Learning

Hetvi Waghela; Jaydip Sen; Sneha Rakshit

arXiv:2412.16254·cs.CR·May 14, 2025

Adversarial Robustness through Dynamic Ensemble Learning

Hetvi Waghela, Jaydip Sen, Sneha Rakshit

PDF

Open Access

TL;DR

This paper introduces ARDEL, a dynamic ensemble learning approach that significantly improves the adversarial robustness of pre-trained language models by adaptively reconfiguring ensembles based on input and attack patterns.

Contribution

ARDEL is a novel ensemble scheme that dynamically adjusts model configurations using a meta-model and adversarial detection, enhancing robustness against attacks.

Findings

01

ARDEL outperforms existing defenses in robustness tests.

02

Dynamic reconfiguration reduces attack success rates.

03

Maintains higher accuracy under adversarial conditions.

Abstract

Adversarial attacks pose a significant threat to the reliability of pre-trained language models (PLMs) such as GPT, BERT, RoBERTa, and T5. This paper presents Adversarial Robustness through Dynamic Ensemble Learning (ARDEL), a novel scheme designed to enhance the robustness of PLMs against such attacks. ARDEL leverages the diversity of multiple PLMs and dynamically adjusts the ensemble configuration based on input characteristics and detected adversarial patterns. Key components of ARDEL include a meta-model for dynamic weighting, an adversarial pattern detection module, and adversarial training with regularization techniques. Comprehensive evaluations using standardized datasets and various adversarial attack scenarios demonstrate that ARDEL significantly improves robustness compared to existing methods. By dynamically reconfiguring the ensemble to prioritize the most robust models for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Anomaly Detection Techniques and Applications

MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Warmup With Cosine Annealing · Discriminative Fine-Tuning · Linear Layer · Linear Warmup With Linear Decay · Byte Pair Encoding · Dense Connections