ASCAI: Adaptive Sampling for acquiring Compact AI
Mojan Javaheripi, Mohammad Samragh, Tara Javidi, Farinaz, Koushanfar

TL;DR
ASCAI introduces an adaptive sampling method inspired by genetic algorithms to efficiently optimize deep neural network compression hyperparameters, achieving better accuracy and compression trade-offs on resource-limited devices.
Contribution
The paper presents a novel adaptive sampling approach for hyperparameter tuning in DNN compression, outperforming existing rule-based and reinforcement learning methods.
Findings
ASCAI achieves higher compression rates with maintained accuracy.
It outperforms rule-based and reinforcement learning methods.
The method effectively navigates large hyperparameter search spaces.
Abstract
This paper introduces ASCAI, a novel adaptive sampling methodology that can learn how to effectively compress Deep Neural Networks (DNNs) for accelerated inference on resource-constrained platforms. Modern DNN compression techniques comprise various hyperparameters that require per-layer customization to ensure high accuracy. Choosing such hyperparameters is cumbersome as the pertinent search space grows exponentially with the number of model layers. To effectively traverse this large space, we devise an intelligent sampling mechanism that adapts the sampling strategy using customized operations inspired by genetic algorithms. As a special case, we consider the space of model compression as a vector space. The adaptively selected samples enable ASCAI to automatically learn how to tune per-layer compression hyperparameters to optimize the accuracy/model-size trade-off. Our extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
