Natural-Parameter Networks: A Class of Probabilistic Neural Networks
Hao Wang, Xingjian Shi, Dit-Yan Yeung

TL;DR
Natural-Parameter Networks (NPN) introduce a flexible probabilistic neural network framework that models weights and neurons with exponential-family distributions, enabling Bayesian learning and improved performance on real-world tasks.
Contribution
NPN provides a novel lightweight Bayesian neural network approach that allows arbitrary exponential-family distributions for weights and neurons, with efficient backpropagation for natural parameters.
Findings
Achieves state-of-the-art performance on real-world datasets
Supports arbitrary exponential-family distributions for model components
Enables second-order representations for tasks like link prediction
Abstract
Neural networks (NN) have achieved state-of-the-art performance in various applications. Unfortunately in applications where training data is insufficient, they are often prone to overfitting. One effective way to alleviate this problem is to exploit the Bayesian approach by using Bayesian neural networks (BNN). Another shortcoming of NN is the lack of flexibility to customize different distributions for the weights and neurons according to the data, as is often done in probabilistic graphical models. To address these problems, we propose a class of probabilistic neural networks, dubbed natural-parameter networks (NPN), as a novel and lightweight Bayesian treatment of NN. NPN allows the usage of arbitrary exponential-family distributions to model the weights and neurons. Different from traditional NN and BNN, NPN takes distributions as input and goes through layers of transformation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Gaussian Processes and Bayesian Inference
