Growing Tiny Networks: Spotting Expressivity Bottlenecks and Fixing Them Optimally
Manon Verbockhaven (TAU, LISN), Sylvain Chevallier (TAU, LISN),, Guillaume Charpiat (TAU, LISN), Th\'eo Rudkiewicz (LISN, A\&O, TAU)

TL;DR
This paper introduces a method to dynamically adapt neural network architectures during training by detecting and fixing expressivity bottlenecks, enabling smaller networks to achieve large-model performance efficiently.
Contribution
It proposes a mathematical framework to identify and address expressivity bottlenecks in neural networks during training, allowing for architecture adaptation and reducing the need for large, fixed networks.
Findings
Achieves CIFAR accuracy comparable to large networks with smaller architectures.
Reduces reliance on extensive hyper-parameter tuning.
Demonstrates efficient training with adaptive architecture modifications.
Abstract
Machine learning tasks are generally formulated as optimization problems, where one searches for an optimal function within a certain functional space. In practice, parameterized functional spaces are considered, in order to be able to perform gradient descent. Typically, a neural network architecture is chosen and fixed, and its parameters (connection weights) are optimized, yielding an architecture-dependent result. This way of proceeding however forces the evolution of the function during training to lie within the realm of what is expressible with the chosen architecture, and prevents any optimization across architectures. Costly architectural hyper-parameter optimization is often performed to compensate for this. Instead, we propose to adapt the architecture on the fly during training. We show that the information about desirable architectural changes, due to expressivity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Machine Learning in Materials Science
