Nonuniform random feature models using derivative information

Konstantin Pieper; Zezhong Zhang; Guannan Zhang

arXiv:2410.02132·cs.LG·October 4, 2024

Nonuniform random feature models using derivative information

Konstantin Pieper, Zezhong Zhang, Guannan Zhang

PDF

Open Access

TL;DR

This paper introduces nonuniform, data-driven parameter distributions for neural network initialization that leverage derivative information, improving upon traditional uniform random feature models in regression tasks.

Contribution

It develops novel nonuniform parameter distributions based on derivative data, enhancing neural network initialization for better approximation of local derivatives.

Findings

01

Distributions concentrate in regions suited for local derivatives.

02

Sampling efficiency is improved with approximate derivative data.

03

Performance approaches that of optimal trained networks.

Abstract

We propose nonuniform data-driven parameter distributions for neural network initialization based on derivative data of the function to be approximated. These parameter distributions are developed in the context of non-parametric regression models based on shallow neural networks, and compare favorably to well-established uniform random feature models based on conventional weight initialization. We address the cases of Heaviside and ReLU activation functions, and their smooth approximations (sigmoid and softplus), and use recent results on the harmonic analysis and sparse representation of neural networks resulting from fully trained optimal networks. Extending analytic results that give exact representation, we obtain densities that concentrate in regions of the parameter space corresponding to neurons that are well suited to model the local derivatives of the unknown function. Based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction

Methods*Communicated@Fast*How Do I Communicate to Expedia?