Enhancing Accuracy and Parameter-Efficiency of Neural Representations   for Network Parameterization

Hongjun Choi; Jayaraman J. Thiagarajan; Ruben Glatt; Shusen Liu

arXiv:2407.00356·cs.LG·July 2, 2024

Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization

Hongjun Choi, Jayaraman J. Thiagarajan, Ruben Glatt, Shusen Liu

PDF

Open Access

TL;DR

This paper investigates the trade-off between accuracy and parameter efficiency in neural network weight parameterization, proposing a novel training scheme that improves both aspects simultaneously.

Contribution

It introduces a new training method that decouples reconstruction from auxiliary objectives, enhancing accuracy and efficiency in neural network representations.

Findings

01

Weight reconstruction alone can recover original model accuracy.

02

Decoupling objectives improves weight reconstruction under efficiency constraints.

03

Proposed approach outperforms state-of-the-art methods in accuracy and efficiency.

Abstract

In this work, we investigate the fundamental trade-off regarding accuracy and parameter efficiency in the parameterization of neural network weights using predictor networks. We present a surprising finding that, when recovering the original model accuracy is the sole objective, it can be achieved effectively through the weight reconstruction objective alone. Additionally, we explore the underlying factors for improving weight reconstruction under parameter-efficiency constraints, and propose a novel training scheme that decouples the reconstruction objective from auxiliary objectives such as knowledge distillation that leads to significant improvements compared to state-of-the-art approaches. Finally, these results pave way for more practical scenarios, where one needs to achieve improvements on both model accuracy and predictor network parameter-efficiency simultaneously.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning in Materials Science

MethodsKnowledge Distillation