Imitating the Functionality of Image-to-Image Models Using a Single Example
Nurit Spingarn-Eliezer, Tomer Michaeli

TL;DR
This paper demonstrates that it is possible to imitate the functionality of proprietary image-to-image translation models using only a single input-output example, through a simple distillation method, even when the original model is unknown.
Contribution
It introduces a method for mimicking image-to-image models from minimal data, revealing vulnerabilities in model confidentiality and providing insights into model imitation.
Findings
Single example often suffices for imitation
Imitation works across various architectures and datasets
Simple distillation can replicate model functionality
Abstract
We study the possibility of imitating the functionality of an image-to-image translation model by observing input-output pairs. We focus on cases where training the model from scratch is impossible, either because training data are unavailable or because the model architecture is unknown. This is the case, for example, with commercial models for biological applications. Since the development of these models requires large investments, their owners commonly keep them confidential, and reveal only a few input-output examples on the company's website or in an academic paper. Surprisingly, we find that even a single example typically suffices for learning to imitate the model's functionality, and that this can be achieved using a simple distillation approach. We present an extensive ablation study encompassing a wide variety of model architectures, datasets and tasks, to characterize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsFocus
