RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks

Amit Chakraborty; Sayyed Farid Ahamed; Sandip Roy; Soumya Banerjee; Kevin Choi; Abdul Rahman; Alison Hu; Edward Bowen; Sachin Shetty

arXiv:2505.19364·cs.CR·May 27, 2025

RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks

Amit Chakraborty, Sayyed Farid Ahamed, Sandip Roy, Soumya Banerjee, Kevin Choi, Abdul Rahman, Alison Hu, Edward Bowen, Sachin Shetty

PDF

TL;DR

RADEP is a comprehensive defense framework that enhances the security of MLaaS models against extraction attacks by combining adversarial training, query detection, adaptive responses, and ownership verification, maintaining performance for legitimate users.

Contribution

This paper introduces RADEP, a novel multi-layered defense framework that effectively counters model extraction attacks with adaptive mechanisms and minimal impact on legitimate queries.

Findings

01

RADEP significantly reduces attack success rates.

02

High detection accuracy for malicious queries.

03

Resilient against adaptive adversaries.

Abstract

Machine Learning as a Service (MLaaS) enables users to leverage powerful machine learning models through cloud-based APIs, offering scalability and ease of deployment. However, these services are vulnerable to model extraction attacks, where adversaries repeatedly query the application programming interface (API) to reconstruct a functionally similar model, compromising intellectual property and security. Despite various defense strategies being proposed, many suffer from high computational costs, limited adaptability to evolving attack techniques, and a reduction in performance for legitimate users. In this paper, we introduce a Resilient Adaptive Defense Framework for Model Extraction Attack Protection (RADEP), a multifaceted defense framework designed to counteract model extraction attacks through a multi-layered security approach. RADEP employs progressive adversarial training to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.