From Hard to Soft: Understanding Deep Network Nonlinearities via Vector Quantization and Statistical Inference
Randall Balestriero, Richard G. Baraniuk

TL;DR
This paper extends the theoretical understanding of deep network nonlinearities by linking them to vector quantization and Gaussian Mixture Models, enabling new interpretations and improvements in neural network design.
Contribution
It generalizes the MASO framework to include sigmoid, tanh, and softmax nonlinearities through probabilistic GMMs, and introduces a hybrid VQ inference method for enhanced neural network performance.
Findings
ReLU and max-pooling relate to hard VQ inference.
Sigmoid, tanh, softmax relate to soft VQ inference.
Orthogonal filters improve deep network performance.
Abstract
Nonlinearity is crucial to the performance of a deep (neural) network (DN). To date there has been little progress understanding the menagerie of available nonlinearities, but recently progress has been made on understanding the r\^ole played by piecewise affine and convex nonlinearities like the ReLU and absolute value activation functions and max-pooling. In particular, DN layers constructed from these operations can be interpreted as {\em max-affine spline operators} (MASOs) that have an elegant link to vector quantization (VQ) and -means. While this is good theoretical progress, the entire MASO approach is predicated on the requirement that the nonlinearities be piecewise affine and convex, which precludes important activation functions like the sigmoid, hyperbolic tangent, and softmax. {\em This paper extends the MASO framework to these and an infinitely large class of new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Remote-Sensing Image Classification · Advanced Image Fusion Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia?
