Description of the Training Process of Neural Networks via Ergodic Theorem : Ghost nodes

Eun-Ji Park; Sangwon Yun

arXiv:2507.01003·cs.LG·July 15, 2025

Description of the Training Process of Neural Networks via Ergodic Theorem : Ghost nodes

Eun-Ji Park, Sangwon Yun

PDF

Open Access

TL;DR

This paper introduces a novel ergodic-theoretic framework for understanding neural network training, proposing ghost nodes to enhance early training efficiency without affecting long-term performance.

Contribution

It presents a unified ergodic perspective on training dynamics and introduces ghost nodes as a new architectural extension to accelerate early training phases.

Findings

01

Ghost nodes improve early training convergence.

02

The Lyapunov exponent diagnostic distinguishes convergence from saddle points.

03

Ghost dimensions collapse after convergence, preserving original model behavior.

Abstract

Recent studies have proposed interpreting the training process from an ergodic perspective. Building on this foundation, we present a unified framework for understanding and accelerating the training of deep neural networks via stochastic gradient descent (SGD). By analyzing the geometric landscape of the objective function we introduce a practical diagnostic, the running estimate of the largest Lyapunov exponent, which provably distinguishes genuine convergence toward stable minimizers from mere statistical stabilization near saddle points. We then propose a ghost category extension for standard classifiers that adds auxiliary ghost output nodes so the model gains extra descent directions that open a lateral corridor around narrow loss barriers and enable the optimizer to bypass poor basins during the early training phase. We show that this extension strictly reduces the approximation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Model Reduction and Neural Networks