Stealix: Model Stealing via Prompt Evolution
Zhixiong Zhuang, Hui-Po Wang, Maria-Irina Nicolae, Mario Fritz

TL;DR
Stealix introduces a novel prompt evolution method using genetic algorithms to perform model stealing without predefined prompts, significantly increasing attack efficiency and scalability against open-source pre-trained models.
Contribution
This work presents the first prompt-free approach for model stealing that infers data distribution and refines synthetic data generation without prior prompt knowledge.
Findings
Stealix outperforms existing methods in accuracy and diversity of generated data.
It operates effectively without class name knowledge or manual prompt crafting.
The approach demonstrates high scalability and threat potential of pre-trained models.
Abstract
Model stealing poses a significant security risk in machine learning by enabling attackers to replicate a black-box model without access to its training data, thus jeopardizing intellectual property and exposing sensitive information. Recent methods that use pre-trained diffusion models for data synthesis improve efficiency and performance but rely heavily on manually crafted prompts, limiting automation and scalability, especially for attackers with little expertise. To assess the risks posed by open-source pre-trained models, we propose a more realistic threat model that eliminates the need for prompt design skills or knowledge of class names. In this context, we introduce Stealix, the first approach to perform model stealing without predefined prompts. Stealix uses two open-source pre-trained models to infer the victim model's data distribution, and iteratively refines prompts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Security and Verification in Computing
MethodsDiffusion
