Detecting Learning vs Memorization in Deep Neural Networks using Shared   Structure Validation Sets

Elias Chaibub Neto

arXiv:1802.07714·stat.ML·February 22, 2018

Detecting Learning vs Memorization in Deep Neural Networks using Shared Structure Validation Sets

Elias Chaibub Neto

PDF

Open Access

TL;DR

This paper introduces a permutation-based method to distinguish between learning and memorization in deep neural networks by comparing feature-based classifier performance on shared-structure validation sets, revealing insights into DNN behavior.

Contribution

A novel permutation approach that separates learning from memorization in DNNs by evaluating shallow classifiers trained on features with shared-structure validation sets.

Findings

01

DNNs can still learn from Gaussian noise inputs.

02

The method corroborates previous findings on memorization patterns.

03

Insights into how DNNs memorize over training epochs.

Abstract

The roles played by learning and memorization represent an important topic in deep learning research. Recent work on this subject has shown that the optimization behavior of DNNs trained on shuffled labels is qualitatively different from DNNs trained with real labels. Here, we propose a novel permutation approach that can differentiate memorization from learning in deep neural networks (DNNs) trained as usual (i.e., using the real labels to guide the learning, rather than shuffled labels). The evaluation of weather the DNN has learned and/or memorized, happens in a separate step where we compare the predictive performance of a shallow classifier trained with the features learned by the DNN, against multiple instances of the same classifier, trained on the same input, but using shuffled labels as outputs. By evaluating these shallow classifiers in validation sets that share structure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification