Deep neural networks from the perspective of ergodic theory

Fan Zhang

arXiv:2308.03888·cs.LG·February 18, 2025

Deep neural networks from the perspective of ergodic theory

Fan Zhang

PDF

Open Access

TL;DR

This paper explores deep neural networks through the lens of ergodic theory, proposing that viewing networks as dynamical systems can clarify design heuristics and provide a more scientific understanding.

Contribution

It introduces a novel perspective by applying ergodic theory to neural networks, offering heuristic explanations for network design principles.

Findings

01

Networks can be modeled as dynamical systems with layers as time steps

02

Ergodic theory provides insights into the behavior of deep networks

03

Design heuristics can be understood as emergent properties of ergodic dynamics

Abstract

The design of deep neural networks remains somewhat of an art rather than precise science. By tentatively adopting ergodic theory considerations on top of viewing the network as the time evolution of a dynamical system, with each layer corresponding to a temporal instance, we show that some rules of thumb, which might otherwise appear mysterious, can be attributed heuristics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications · Time Series Analysis and Forecasting · Neural Networks and Applications