From Fewer Samples to Fewer Bits: Reframing Dataset Distillation as Joint Optimization of Precision and Compactness

My H. Dinh; Aditya Sant; Akshay Malhotra; Keya Patani; Shahab Hamidi-Rad

arXiv:2603.02411·cs.CV·March 4, 2026

From Fewer Samples to Fewer Bits: Reframing Dataset Distillation as Joint Optimization of Precision and Compactness

My H. Dinh, Aditya Sant, Akshay Malhotra, Keya Patani, Shahab Hamidi-Rad

PDF

Open Access

TL;DR

This paper introduces QuADD, a novel dataset distillation framework that jointly optimizes dataset size and data precision, leading to more efficient training data representations with better accuracy per bit.

Contribution

It presents a unified, end-to-end approach integrating quantization into dataset distillation, enabling joint optimization of sample count and precision under fixed bit budgets.

Findings

01

QuADD outperforms existing methods in accuracy per bit.

02

Adaptive non-uniform quantization improves data representation.

03

Joint optimization enhances efficiency in image and communication tasks.

Abstract

Dataset Distillation (DD) compresses large datasets into compact synthetic ones that maintain training performance. However, current methods mainly target sample reduction, with limited consideration of data precision and its impact on efficiency. We propose Quantization-aware Dataset Distillation (QuADD), a unified framework that jointly optimizes dataset compactness and precision under fixed bit budgets. QuADD integrates a differentiable quantization module within the distillation loop, enabling end-to-end co-optimization of synthetic samples and quantization parameters. Guided by the rate-distortion perspective, we empirically analyze how bit allocation between sample count and precision influences learning performance. Our framework supports both uniform and adaptive non-uniform quantization, where the latter learns quantization levels from data to represent information-dense…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning