On "Deep Learning" Misconduct
Juyang Weng

TL;DR
This theoretical paper highlights a misconduct in deep learning where models are tested on training data, violating protocols and undermining generalization, demonstrated through a theorem involving the PGNN method.
Contribution
The paper establishes a theorem showing that common misconduct allows perfect training and validation errors, challenging the generalizability of deep learning models.
Findings
PGNN can achieve zero error on validation and test sets.
Misconduct violates transparency and cross-validation protocols.
Deep learning models may not be genuinely generalizable.
Abstract
This is a theoretical paper, as a companion paper of the plenary talk for the same conference ISAIC 2022. In contrast to the author's plenary talk in the same conference, conscious learning (Weng, 2022b; Weng, 2022c) which develops a single network for a life (many tasks), "Deep Learning" trains multiple networks for each task. Although "Deep Learning" may use different learning modes, including supervised, reinforcement and adversarial modes, almost all "Deep Learning" projects apparently suffer from the same misconduct, called "data deletion" and "test on training data". This paper establishes a theorem that a simple method called Pure-Guess Nearest Neighbor (PGNN) reaches any required errors on validation data set and test data set, including zero-error requirements, through the same misconduct, as long as the test data set is in the possession of the authors and both the amount of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Privacy-Preserving Technologies in Data
MethodsTest
