Multi-Objective Linear Ensembles for Robust and Sparse Training of   Few-Bit Neural Networks

Ambrogio Maria Bernardelli; Stefano Gualandi; Hoong Chuin Lau; Simone; Milanesi; Neil Yorke-Smith

arXiv:2212.03659·math.OC·September 12, 2024

Multi-Objective Linear Ensembles for Robust and Sparse Training of Few-Bit Neural Networks

Ambrogio Maria Bernardelli, Stefano Gualandi, Hoong Chuin Lau, Simone, Milanesi, Neil Yorke-Smith

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-objective ensemble method for training robust, sparse few-bit neural networks, significantly improving accuracy in low-data scenarios compared to existing solver and gradient-based approaches.

Contribution

It proposes a novel ensemble approach that trains a single neural network per class pair and applies majority voting, enhancing robustness and sparsity in few-bit neural networks.

Findings

01

Achieves 68.4% accuracy on MNIST with 10 images per class.

02

Reduces network size by up to 75.3% through sparsification.

03

Demonstrates robustness against input perturbations.

Abstract

Training neural networks (NNs) using combinatorial optimization solvers has gained attention in recent years. In low-data settings, state-of-the-art mixed integer linear programming solvers can train exactly a NN, avoiding intensive GPU-based training and hyper-parameter tuning and simultaneously training and sparsifying the network. We study the case of few-bit discrete-valued neural networks, both Binarized Neural Networks (BNNs), whose values are restricted to +-1, and Integer Neural Networks (INNs), whose values lie in a range {-P, ..., P}. Few-bit NNs receive increasing recognition due to their lightweight architecture and ability to run on low-power devices. This paper proposes new methods to improve the training of BNNs and INNs. Our contribution is a multi-objective ensemble approach based on training a single NN for each possible pair of classes and applying a majority voting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

informsjoc/2023.0281
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Domain Adaptation and Few-Shot Learning