Stealix: Model Stealing via Prompt Evolution

Zhixiong Zhuang; Hui-Po Wang; Maria-Irina Nicolae; Mario Fritz

arXiv:2506.05867·cs.CR·June 9, 2025

Stealix: Model Stealing via Prompt Evolution

Zhixiong Zhuang, Hui-Po Wang, Maria-Irina Nicolae, Mario Fritz

PDF

Open Access

TL;DR

Stealix introduces a novel prompt evolution method using genetic algorithms to perform model stealing without predefined prompts, significantly increasing attack efficiency and scalability against open-source pre-trained models.

Contribution

This work presents the first prompt-free approach for model stealing that infers data distribution and refines synthetic data generation without prior prompt knowledge.

Findings

01

Stealix outperforms existing methods in accuracy and diversity of generated data.

02

It operates effectively without class name knowledge or manual prompt crafting.

03

The approach demonstrates high scalability and threat potential of pre-trained models.

Abstract

Model stealing poses a significant security risk in machine learning by enabling attackers to replicate a black-box model without access to its training data, thus jeopardizing intellectual property and exposing sensitive information. Recent methods that use pre-trained diffusion models for data synthesis improve efficiency and performance but rely heavily on manually crafted prompts, limiting automation and scalability, especially for attackers with little expertise. To assess the risks posed by open-source pre-trained models, we propose a more realistic threat model that eliminates the need for prompt design skills or knowledge of class names. In this context, we introduce Stealix, the first approach to perform model stealing without predefined prompts. Stealix uses two open-source pre-trained models to infer the victim model's data distribution, and iteratively refines prompts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Security and Verification in Computing

MethodsDiffusion