Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization
Hongjun Choi, Jayaraman J. Thiagarajan, Ruben Glatt, Shusen Liu

TL;DR
This paper investigates the trade-off between accuracy and parameter efficiency in neural network weight parameterization, proposing a novel training scheme that improves both aspects simultaneously.
Contribution
It introduces a new training method that decouples reconstruction from auxiliary objectives, enhancing accuracy and efficiency in neural network representations.
Findings
Weight reconstruction alone can recover original model accuracy.
Decoupling objectives improves weight reconstruction under efficiency constraints.
Proposed approach outperforms state-of-the-art methods in accuracy and efficiency.
Abstract
In this work, we investigate the fundamental trade-off regarding accuracy and parameter efficiency in the parameterization of neural network weights using predictor networks. We present a surprising finding that, when recovering the original model accuracy is the sole objective, it can be achieved effectively through the weight reconstruction objective alone. Additionally, we explore the underlying factors for improving weight reconstruction under parameter-efficiency constraints, and propose a novel training scheme that decouples the reconstruction objective from auxiliary objectives such as knowledge distillation that leads to significant improvements compared to state-of-the-art approaches. Finally, these results pave way for more practical scenarios, where one needs to achieve improvements on both model accuracy and predictor network parameter-efficiency simultaneously.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning in Materials Science
MethodsKnowledge Distillation
