IDEAL: Query-Efficient Data-Free Learning from Black-box Models

Jie Zhang; Chen Chen; Lingjuan Lyu

arXiv:2205.11158·cs.AI·August 21, 2023·5 cites

IDEAL: Query-Efficient Data-Free Learning from Black-box Models

Jie Zhang, Chen Chen, Lingjuan Lyu

PDF

Open Access 1 Repo

TL;DR

IDEAL is a novel query-efficient data-free learning method that trains student models from black-box APIs with minimal queries, significantly reducing costs while maintaining high performance.

Contribution

It introduces a two-stage approach for data generation and model distillation that requires only one query per sample, improving efficiency over existing methods.

Findings

01

Outperforms baseline methods on multiple datasets.

02

Reduces query costs by over 50 times.

03

Achieves significant accuracy improvements.

Abstract

Knowledge Distillation (KD) is a typical method for training a lightweight student model with the help of a well-trained teacher model. However, most KD methods require access to either the teacher's training data or model parameters, which is unrealistic. To tackle this problem, recent works study KD under data-free and black-box settings. Nevertheless, these works require a large number of queries to the teacher model, which incurs significant monetary and computational costs. To address these problems, we propose a novel method called \emph{query-effIcient Data-free lEarning from blAck-box modeLs} (IDEAL), which aims to query-efficiently learn from black-box model APIs to train a good student without any real data. In detail, IDEAL trains the student model in two stages: data generation and model distillation. Note that IDEAL does not require any query in the data generation stage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SonyResearch/IDEAL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Topic Modeling

MethodsKnowledge Distillation