Dropout Neural Network Training Viewed from a Percolation Perspective

Finley Devlin; Jaron Sanders

arXiv:2512.13853·cs.LG·December 17, 2025

Dropout Neural Network Training Viewed from a Percolation Perspective

Finley Devlin, Jaron Sanders

PDF

Open Access

TL;DR

This paper explores the percolation phenomenon in dropout training of neural networks, revealing how random connection removal affects network connectivity and can cause training breakdowns, especially in bias-free models.

Contribution

It introduces percolation models for dropout, characterizes the relationship between network topology and connectivity, and demonstrates the percolative effect's impact on training stability.

Findings

01

Percolation effects exist in dropout training.

02

Dropout can cause network disconnection and training breakdown.

03

Breakdowns are more severe in bias-free neural networks.

Abstract

In this work, we investigate the existence and effect of percolation in training deep Neural Networks (NNs) with dropout. Dropout methods are regularisation techniques for training NNs, first introduced by G. Hinton et al. (2012). These methods temporarily remove connections in the NN, randomly at each stage of training, and update the remaining subnetwork with Stochastic Gradient Descent (SGD). The process of removing connections from a network at random is similar to percolation, a paradigm model of statistical physics. If dropout were to remove enough connections such that there is no path between the input and output of the NN, then the NN could not make predictions informed by the data. We study new percolation models that mimic dropout in NNs and characterise the relationship between network topology and this path problem. The theory shows the existence of a percolative effect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Machine Learning and ELM