IDEAL: Query-Efficient Data-Free Learning from Black-box Models
Jie Zhang, Chen Chen, Lingjuan Lyu

TL;DR
IDEAL is a novel query-efficient data-free learning method that trains student models from black-box APIs with minimal queries, significantly reducing costs while maintaining high performance.
Contribution
It introduces a two-stage approach for data generation and model distillation that requires only one query per sample, improving efficiency over existing methods.
Findings
Outperforms baseline methods on multiple datasets.
Reduces query costs by over 50 times.
Achieves significant accuracy improvements.
Abstract
Knowledge Distillation (KD) is a typical method for training a lightweight student model with the help of a well-trained teacher model. However, most KD methods require access to either the teacher's training data or model parameters, which is unrealistic. To tackle this problem, recent works study KD under data-free and black-box settings. Nevertheless, these works require a large number of queries to the teacher model, which incurs significant monetary and computational costs. To address these problems, we propose a novel method called \emph{query-effIcient Data-free lEarning from blAck-box modeLs} (IDEAL), which aims to query-efficiently learn from black-box model APIs to train a good student without any real data. In detail, IDEAL trains the student model in two stages: data generation and model distillation. Note that IDEAL does not require any query in the data generation stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Topic Modeling
MethodsKnowledge Distillation
