Combinatorial Designs for Deep Learning
Shoko Chisaki, Ryoh Fuji-Hara, Nobuko Miyamoto

TL;DR
This paper introduces a combinatorial design approach to dropout in deep learning, aiming to balance edge frequency and reduce overfitting, by analyzing and constructing dropout node patterns inspired by experimental design.
Contribution
It proposes a novel combinatorial design method for dropout in neural networks, enhancing regularization by balancing edge sampling frequencies.
Findings
Designs improve dropout regularization effectiveness
Balanced dropout reduces overfitting
Constructed combinatorial patterns demonstrate theoretical and practical benefits
Abstract
Deep learning is a machine learning methodology using multi-layer neural network. A multi-layer neural network can be regarded as a chain of complete bipartite graphs. The nodes of the first partita is the input layer and the last is the output layer. The edges of a bipartite graph function as weights which are represented as a matrix. The values of i-th partita are computed by multiplication of the weight matrix and values of (i-1)-th partita. Using mass training and teacher data, the weight parameters are estimated little by little. Overfitting (or Overlearning) refers to a model that models the `training data` too well. It then becomes difficult for the model to generalize to new data which were not in the training set. The most popular method to avoid overfitting is called dropout. Dropout deletes a random sample of activations (nodes) to zero during the training process. A random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
