Auto-tuning of Deep Neural Networks by Conflicting Layer Removal

David Peer; Sebastian Stabinger; Antonio Rodriguez-Sanchez

arXiv:2103.04331·cs.LG·March 9, 2021·1 cites

Auto-tuning of Deep Neural Networks by Conflicting Layer Removal

David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to identify and remove conflicting layers in neural networks, leading to simpler architectures with comparable accuracy and reduced resource consumption.

Contribution

It presents a novel approach to detect conflicting layers early in training and a NAS algorithm to optimize network architecture by removing such layers.

Findings

01

Up to 60% of layers can be removed without significant accuracy loss.

02

The proposed NAS algorithm effectively identifies conflicting layers early.

03

Architectures with removed conflicting layers achieve competitive accuracy with less memory and faster inference.

Abstract

Designing neural network architectures is a challenging task and knowing which specific layers of a model must be adapted to improve the performance is almost a mystery. In this paper, we introduce a novel methodology to identify layers that decrease the test accuracy of trained models. Conflicting layers are detected as early as the beginning of training. In the worst-case scenario, we prove that such a layer could lead to a network that cannot be trained at all. A theoretical analysis is provided on what is the origin of those layers that result in a lower overall network performance, which is complemented by our extensive empirical evaluation. More precisely, we identified those layers that worsen the performance because they would produce what we name conflicting training bundles. We will show that around 60% of the layers of trained residual networks can be completely removed from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

peerdavid/conflicting-bundles
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning