ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees
Kuan-Lin Chen, Ching-Hua Lee, Harinath Garudadri, Bhaskar D. Rao

TL;DR
This paper introduces ResNEsts and DenseNEsts, novel block-based deep neural network models that improve representation guarantees and address feature reuse issues, with DenseNEsts outperforming ResNets without architectural redesign.
Contribution
It defines ResNEsts by removing nonlinearities at the last residual, introduces A-ResNEst for performance guarantees, and proposes DenseNEst, a DenseNet-inspired model with better properties.
Findings
ResNEsts guarantee performance with added blocks when expanded.
A-ResNEst provides empirical risk lower bounds for ResNEsts.
DenseNEsts outperform ResNets without architectural changes.
Abstract
Models recently used in the literature proving residual networks (ResNets) are better than linear predictors are actually different from standard ResNets that have been widely used in computer vision. In addition to the assumptions such as scalar-valued output or single residual block, these models have no nonlinearities at the final residual representation that feeds into the final affine layer. To codify such a difference in nonlinearities and reveal a linear estimation property, we define ResNEsts, i.e., Residual Nonlinear Estimators, by simply dropping nonlinearities at the last residual representation from standard ResNets. We show that wide ResNEsts with bottleneck blocks can always guarantee a very desirable training property that standard ResNets aim to achieve, i.e., adding more blocks does not decrease performance given the same set of basis elements. To prove that, we first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · 1x1 Convolution · guidence~How to file a complaint against Expedia? · Residual Connection · Convolution · Global Average Pooling · Dense Connections · Softmax · Batch Normalization
