Universal approximation and model compression for radial neural networks

Iordan Ganev; Twan van Laarhoven; Robin Walters

arXiv:2107.02550·cs.LG·February 17, 2023·1 cites

Universal approximation and model compression for radial neural networks

Iordan Ganev, Twan van Laarhoven, Robin Walters

PDF

Open Access 1 Repo

TL;DR

This paper introduces radial neural networks with norm-dependent activation functions, proves their universal approximation capabilities, and develops a lossless model compression method leveraging their symmetry properties.

Contribution

It extends neural network theory by establishing universal approximation for radial networks and proposes a novel, symmetry-based model compression technique.

Findings

01

Radial neural networks can approximate any continuous function on bounded domains.

02

A symmetry-based lossless compression algorithm for radial networks is developed.

03

Optimization on compressed models is equivalent to projected gradient descent on full models.

Abstract

We introduce a class of fully-connected neural networks whose activation functions, rather than being pointwise, rescale feature vectors by a function depending only on their norm. We call such networks radial neural networks, extending previous work on rotation equivariant networks that considers rescaling activations in less generality. We prove universal approximation theorems for radial neural networks, including in the more difficult cases of bounded widths and unbounded domains. Our proof techniques are novel, distinct from those in the pointwise case. Additionally, radial neural networks exhibit a rich group of orthogonal change-of-basis symmetries on the vector space of trainable parameters. Factoring out these symmetries leads to a practical lossless model compression algorithm. Optimization of the compressed model by gradient descent is equivalent to projected gradient descent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ivganev/QR-decomposition-radial-NNs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Matrix Theory and Algorithms