Learn Like The Pro: Norms from Theory to Size Neural Computation

Margaret Trautner; Ziwei Li; Sai Ravela

arXiv:2106.11409·cs.LG·June 23, 2021·1 cites

Learn Like The Pro: Norms from Theory to Size Neural Computation

Margaret Trautner, Ziwei Li, Sai Ravela

PDF

Open Access

TL;DR

This paper introduces a theory-based method to determine neural network sizes using a Learnability metric derived from dynamical systems, enabling size estimation without training data.

Contribution

It proposes a novel, training-free approach to estimate neural network sizes based on dynamical system norms, bridging theory and neural network design.

Findings

01

Provides exact sizing for neural networks with multiplicative nodes.

02

Offers tight lower bounds for classical feed-forward networks.

03

The approach aligns well with simulated assessments.

Abstract

The optimal design of neural networks is a critical problem in many applications. Here, we investigate how dynamical systems with polynomial nonlinearities can inform the design of neural systems that seek to emulate them. We propose a Learnability metric and its associated features to quantify the near-equilibrium behavior of learning dynamics. Equating the Learnability of neural systems with equivalent parameter estimation metric of the reference system establishes bounds on network structure. In this way, norms from theory provide a good first guess for neural structure, which may then further adapt with data. The proposed approach neither requires training nor training data. It reveals exact sizing for a class of neural networks with multiplicative nodes that mimic continuous- or discrete-time polynomial dynamics. It also provides relatively tight lower size bounds for classical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Machine Learning and Algorithms