Combinatorial Designs for Deep Learning

Shoko Chisaki; Ryoh Fuji-Hara; Nobuko Miyamoto

arXiv:1809.08404·math.CO·May 11, 2020

Combinatorial Designs for Deep Learning

Shoko Chisaki, Ryoh Fuji-Hara, Nobuko Miyamoto

PDF

TL;DR

This paper introduces a combinatorial design approach to dropout in deep learning, aiming to balance edge frequency and reduce overfitting, by analyzing and constructing dropout node patterns inspired by experimental design.

Contribution

It proposes a novel combinatorial design method for dropout in neural networks, enhancing regularization by balancing edge sampling frequencies.

Findings

01

Designs improve dropout regularization effectiveness

02

Balanced dropout reduces overfitting

03

Constructed combinatorial patterns demonstrate theoretical and practical benefits

Abstract

Deep learning is a machine learning methodology using multi-layer neural network. A multi-layer neural network can be regarded as a chain of complete bipartite graphs. The nodes of the first partita is the input layer and the last is the output layer. The edges of a bipartite graph function as weights which are represented as a matrix. The values of i-th partita are computed by multiplication of the weight matrix and values of (i-1)-th partita. Using mass training and teacher data, the weight parameters are estimated little by little. Overfitting (or Overlearning) refers to a model that models the `training data` too well. It then becomes difficult for the model to generalize to new data which were not in the training set. The most popular method to avoid overfitting is called dropout. Dropout deletes a random sample of activations (nodes) to zero during the training process. A random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.