ReLU activated Multi-Layer Neural Networks trained with Mixed Integer   Linear Programs

Steffen Goebbels

arXiv:2008.08386·cs.LG·April 12, 2021

ReLU activated Multi-Layer Neural Networks trained with Mixed Integer Linear Programs

Steffen Goebbels

PDF

Open Access

TL;DR

This paper demonstrates that ReLU-activated multi-layer neural networks can be trained using Mixed Integer Linear Programs, achieving comparable accuracy to traditional methods on MNIST.

Contribution

It introduces a novel MILP-based training method for neural networks, providing an alternative to gradient-based optimization.

Findings

01

Achieved MNIST accuracy comparable to TensorFlow/Keras.

02

Validated MILP-based training on simple neural networks.

03

Showed iterative layer-wise weight adjustment using MILPs.

Abstract

In this paper, it is demonstrated through a case study that multilayer feedforward neural networks activated by ReLU functions can in principle be trained iteratively with Mixed Integer Linear Programs (MILPs) as follows. Weights are determined with batch learning. Multiple iterations are used per batch of training data. In each iteration, the algorithm starts at the output layer and propagates information back to the first hidden layer to adjust the weights using MILPs or Linear Programs. For each layer, the goal is to minimize the difference between its output and the corresponding target output. The target output of the last (output) layer is equal to the ground truth. The target output of a previous layer is defined as the adjusted input of the following layer. For a given layer, weights are computed by solving a MILP. Then, except for the first hidden layer, the input values are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Machine Learning and ELM

Methods*Communicated@Fast*How Do I Communicate to Expedia?