Multi-Objective Linear Ensembles for Robust and Sparse Training of Few-Bit Neural Networks
Ambrogio Maria Bernardelli, Stefano Gualandi, Hoong Chuin Lau, Simone, Milanesi, Neil Yorke-Smith

TL;DR
This paper introduces a multi-objective ensemble method for training robust, sparse few-bit neural networks, significantly improving accuracy in low-data scenarios compared to existing solver and gradient-based approaches.
Contribution
It proposes a novel ensemble approach that trains a single neural network per class pair and applies majority voting, enhancing robustness and sparsity in few-bit neural networks.
Findings
Achieves 68.4% accuracy on MNIST with 10 images per class.
Reduces network size by up to 75.3% through sparsification.
Demonstrates robustness against input perturbations.
Abstract
Training neural networks (NNs) using combinatorial optimization solvers has gained attention in recent years. In low-data settings, state-of-the-art mixed integer linear programming solvers can train exactly a NN, avoiding intensive GPU-based training and hyper-parameter tuning and simultaneously training and sparsifying the network. We study the case of few-bit discrete-valued neural networks, both Binarized Neural Networks (BNNs), whose values are restricted to +-1, and Integer Neural Networks (INNs), whose values lie in a range {-P, ..., P}. Few-bit NNs receive increasing recognition due to their lightweight architecture and ability to run on low-power devices. This paper proposes new methods to improve the training of BNNs and INNs. Our contribution is a multi-objective ensemble approach based on training a single NN for each possible pair of classes and applying a majority voting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Domain Adaptation and Few-Shot Learning
