Auto-tuning of Deep Neural Networks by Conflicting Layer Removal
David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

TL;DR
This paper introduces a method to identify and remove conflicting layers in neural networks, leading to simpler architectures with comparable accuracy and reduced resource consumption.
Contribution
It presents a novel approach to detect conflicting layers early in training and a NAS algorithm to optimize network architecture by removing such layers.
Findings
Up to 60% of layers can be removed without significant accuracy loss.
The proposed NAS algorithm effectively identifies conflicting layers early.
Architectures with removed conflicting layers achieve competitive accuracy with less memory and faster inference.
Abstract
Designing neural network architectures is a challenging task and knowing which specific layers of a model must be adapted to improve the performance is almost a mystery. In this paper, we introduce a novel methodology to identify layers that decrease the test accuracy of trained models. Conflicting layers are detected as early as the beginning of training. In the worst-case scenario, we prove that such a layer could lead to a network that cannot be trained at all. A theoretical analysis is provided on what is the origin of those layers that result in a lower overall network performance, which is complemented by our extensive empirical evaluation. More precisely, we identified those layers that worsen the performance because they would produce what we name conflicting training bundles. We will show that around 60% of the layers of trained residual networks can be completely removed from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
