Dense Sample Deep Learning
Stephen Jos\`e Hanson, Vivek Yadav, Catherine Hanson

TL;DR
This paper investigates the learning mechanisms of deep neural networks using a high-density sampling approach, revealing insights into feature construction and proposing a new theory of how complex features emerge.
Contribution
It introduces a novel high-density sampling task to analyze deep learning models, providing new insights into their learning dynamics and feature construction processes.
Findings
Visualization of feature emergence over training
Identification of coupling between feature detectors
Proposal of a new theory of complex feature construction
Abstract
Deep Learning (DL) , a variant of the neural network algorithms originally proposed in the 1980s, has made surprising progress in Artificial Intelligence (AI), ranging from language translation, protein folding, autonomous cars, and more recently human-like language models (CHATbots), all that seemed intractable until very recently. Despite the growing use of Deep Learning (DL) networks, little is actually understood about the learning mechanisms and representations that makes these networks effective across such a diverse range of applications. Part of the answer must be the huge scale of the architecture and of course the large scale of the data, since not much has changed since 1987. But the nature of deep learned representations remain largely unknown. Unfortunately training sets with millions or billions of tokens have unknown combinatorics and Networks with millions or billions of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications
