Leveraging chaotic transients in the training of artificial neural networks

Pedro Jim\'enez-Gonz\'alez; Miguel C. Soriano; Lucas Lacasa

arXiv:2506.08523·cs.LG·March 10, 2026

Leveraging chaotic transients in the training of artificial neural networks

Pedro Jim\'enez-Gonz\'alez, Miguel C. Soriano, Lucas Lacasa

PDF

Open Access

TL;DR

This paper investigates how large learning rates induce chaotic dynamics during neural network training, revealing that transient chaos can accelerate learning and improve training efficiency across various architectures and tasks.

Contribution

It demonstrates that chaotic transients at high learning rates can be harnessed to optimize neural network training, a novel insight into the role of chaos in learning dynamics.

Findings

01

Optimal training speed occurs near the onset of chaos.

02

Chaotic dynamics are observed across different architectures and tasks.

03

Transient chaos can be beneficial for faster convergence.

Abstract

Traditional algorithms to optimize artificial neural networks when confronted with a supervised learning task are usually exploitation-type relaxational dynamics such as gradient descent (GD). Here, we explore the dynamics of the neural network trajectory along training for unconventionally large learning rates. We show that for a region of values of the learning rate, the GD optimization shifts away from purely exploitation-like algorithm into a regime of exploration-exploitation balance, as the neural network is still capable of learning but the trajectory shows sensitive dependence on initial conditions --as characterized by positive network maximum Lyapunov exponent--. Interestingly, the characteristic training time required to reach an acceptable accuracy in the test set reaches a minimum precisely in such learning rate region, further suggesting that one can accelerate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Stochastic Gradient Optimization Techniques · Neural Networks and Applications

MethodsSparse Evolutionary Training