RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks
Amit Chakraborty, Sayyed Farid Ahamed, Sandip Roy, Soumya Banerjee, Kevin Choi, Abdul Rahman, Alison Hu, Edward Bowen, Sachin Shetty

TL;DR
RADEP is a comprehensive defense framework that enhances the security of MLaaS models against extraction attacks by combining adversarial training, query detection, adaptive responses, and ownership verification, maintaining performance for legitimate users.
Contribution
This paper introduces RADEP, a novel multi-layered defense framework that effectively counters model extraction attacks with adaptive mechanisms and minimal impact on legitimate queries.
Findings
RADEP significantly reduces attack success rates.
High detection accuracy for malicious queries.
Resilient against adaptive adversaries.
Abstract
Machine Learning as a Service (MLaaS) enables users to leverage powerful machine learning models through cloud-based APIs, offering scalability and ease of deployment. However, these services are vulnerable to model extraction attacks, where adversaries repeatedly query the application programming interface (API) to reconstruct a functionally similar model, compromising intellectual property and security. Despite various defense strategies being proposed, many suffer from high computational costs, limited adaptability to evolving attack techniques, and a reduction in performance for legitimate users. In this paper, we introduce a Resilient Adaptive Defense Framework for Model Extraction Attack Protection (RADEP), a multifaceted defense framework designed to counteract model extraction attacks through a multi-layered security approach. RADEP employs progressive adversarial training to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
