Dense Sample Deep Learning

Stephen Jos\`e Hanson; Vivek Yadav; Catherine Hanson

arXiv:2307.10991·cs.AI·July 24, 2023

Dense Sample Deep Learning

Stephen Jos\`e Hanson, Vivek Yadav, Catherine Hanson

PDF

Open Access

TL;DR

This paper investigates the learning mechanisms of deep neural networks using a high-density sampling approach, revealing insights into feature construction and proposing a new theory of how complex features emerge.

Contribution

It introduces a novel high-density sampling task to analyze deep learning models, providing new insights into their learning dynamics and feature construction processes.

Findings

01

Visualization of feature emergence over training

02

Identification of coupling between feature detectors

03

Proposal of a new theory of complex feature construction

Abstract

Deep Learning (DL) , a variant of the neural network algorithms originally proposed in the 1980s, has made surprising progress in Artificial Intelligence (AI), ranging from language translation, protein folding, autonomous cars, and more recently human-like language models (CHATbots), all that seemed intractable until very recently. Despite the growing use of Deep Learning (DL) networks, little is actually understood about the learning mechanisms and representations that makes these networks effective across such a diverse range of applications. Part of the answer must be the huge scale of the architecture and of course the large scale of the data, since not much has changed since 1987. But the nature of deep learned representations remain largely unknown. Unfortunately training sets with millions or billions of tokens have unknown combinatorics and Networks with millions or billions of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications