The mechanism underlying successful deep learning
Yarden Tzach, Yuval Meir, Ofek Tevet, Ronit D. Gross, Shiri Hodassman,, Roni Vardi, Ido Kanter

TL;DR
This paper introduces a three-phase method to analyze and quantify the underlying mechanism of successful deep learning, revealing how filters contribute to classification success.
Contribution
It presents a novel procedure to dissect deep architectures, quantifying filter functions and their role in improving success rates and feature extraction.
Findings
Filters tend to select single output labels, reducing noise and sharpening features.
Layer-wise sharpening of features enhances success rates and signal-to-noise ratios.
The method effectively analyzes architectures like VGG-16, VGG-6, and AVGG-16.
Abstract
Deep architectures consist of tens or hundreds of convolutional layers (CLs) that terminate with a few fully connected (FC) layers and an output layer representing the possible labels of a complex classification task. According to the existing deep learning (DL) rationale, the first CL reveals localized features from the raw data, whereas the subsequent layers progressively extract higher-level features required for refined classification. This article presents an efficient three-phase procedure for quantifying the mechanism underlying successful DL. First, a deep architecture is trained to maximize the success rate (SR). Next, the weights of the first several CLs are fixed and only the concatenated new FC layer connected to the output is trained, resulting in SRs that progress with the layers. Finally, the trained FC weights are silenced, except for those emerging from a single filter,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Industrial Vision Systems and Defect Detection · Machine Learning and ELM
