Conflicting Bundles: Adapting Architectures Towards the Improved   Training of Deep Neural Networks

David Peer; Sebastian Stabinger; Antonio Rodriguez-Sanchez

arXiv:2011.02956·cs.LG·November 6, 2020

Conflicting Bundles: Adapting Architectures Towards the Improved Training of Deep Neural Networks

David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new theory and metric to identify and remove conflicting layers in neural networks early in training, improving performance, reducing memory, and speeding up inference.

Contribution

It presents a novel theoretical framework and algorithm for automatically detecting and removing performance-degrading layers in neural networks.

Findings

01

Identified layers that worsen performance due to conflicting training bundles

02

Proposed an algorithm that automatically removes such layers

03

Resulting architectures achieve competitive accuracy with reduced memory and faster inference

Abstract

Designing neural network architectures is a challenging task and knowing which specific layers of a model must be adapted to improve the performance is almost a mystery. In this paper, we introduce a novel theory and metric to identify layers that decrease the test accuracy of the trained models, this identification is done as early as at the beginning of training. In the worst-case, such a layer could lead to a network that can not be trained at all. More precisely, we identified those layers that worsen the performance because they produce conflicting training bundles as we show in our novel theoretical analysis, complemented by our extensive empirical studies. Based on these findings, a novel algorithm is introduced to remove performance decreasing layers automatically. Architectures found by this algorithm achieve a competitive accuracy when compared against the state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

peerdavid/conflicting-bundles
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Advanced Neural Network Applications

Methods(TravEL!!Guide)How Do I File a Claim with Expedia? · Tanh Activation · + ( 1 ) ⟷ 888 ⟷ ( 829 ) ⟷ 0881 How do I file a claim with Expedia?