Exact Gaussian Moment Matching for Residual Networks: a Second-Order Method
Simon Kuang, Xinfan Lin

TL;DR
This paper introduces an exact second-order moment matching method for propagating Gaussian distributions through residual neural networks, significantly improving inference accuracy over existing methods.
Contribution
It derives exact moment matching formulas for various activation functions in residual networks, enabling more accurate uncertainty propagation.
Findings
Orders-of-magnitude reduction in KL divergence error on random networks.
Hundredfold improvement in KL divergence on variational Bayes neural networks.
Provides theoretical error bounds showing improved local accuracy.
Abstract
We study the problem of propagating the mean and covariance of a general multivariate Gaussian distribution through a deep (residual) neural network using layer-by-layer moment matching. We close a longstanding gap by deriving exact moment matching for the probit, GeLU, ReLU (as a limit of GeLU), Heaviside (as a limit of probit), and sine activation functions; for both feedforward and generalized residual layers. On random networks, we find orders-of-magnitude improvements in the KL divergence error metric, up to a millionfold, over popular alternatives. On a variational Bayes neural network, we show that our method attains hundredfold improvements in KL divergence from Monte Carlo ground truth over a state-of-the-art deterministic inference method. We also give a smooth-distance error bound showing that, under regularity assumptions, moment matching removes the leading low-variance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
