HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning
Tianyi Chen, Xiaoyi Qu, David Aponte, Colby Banbury, Jongwoo Ko,, Tianyu Ding, Yong Ma, Vladimir Lyapunov, Ilya Zharkov, Luming Liang

TL;DR
HESSO is a new optimizer that automatically trains and prunes neural networks efficiently, reducing human intervention and preventing performance collapse, applicable across diverse AI tasks.
Contribution
The paper introduces HESSO, a nearly tuning-free, automatic optimizer for neural network pruning, and CRIC, a cycle to prevent performance collapse during pruning.
Findings
HESSO achieves competitive or superior performance compared to state-of-the-art methods.
HESSO-CRIC effectively prevents irreversible performance collapse.
Applicable to various neural network architectures and tasks.
Abstract
Structured pruning is one of the most popular approaches to effectively compress the heavy deep neural networks (DNNs) into compact sub-networks while retaining performance. The existing methods suffer from multi-stage procedures along with significant engineering efforts and human expertise. The Only-Train-Once (OTO) series has been recently proposed to resolve the many pain points by streamlining the workflow by automatically conducting (i) search space generation, (ii) structured sparse optimization, and (iii) sub-network construction. However, the built-in sparse optimizers in the OTO series, i.e., the Half-Space Projected Gradient (HSPG) family, have limitations that require hyper-parameter tuning and the implicit controls of the sparsity exploration, consequently requires intervening by human expertise. To address such limitations, we propose a Hybrid Efficient Structured Sparse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsPruning
