Increasing biases can be more efficient than increasing weights

Carlo Metta; Marco Fantozzi; Andrea Papini; Gianluca Amato; Matteo; Bergamaschi; Silvia Giulia Galfr\`e; Alessandro Marchetti; Michelangelo; Vegli\`o; Maurizio Parton; Francesco Morandin

arXiv:2301.00924·cs.NE·January 19, 2024

Increasing biases can be more efficient than increasing weights

Carlo Metta, Marco Fantozzi, Andrea Papini, Gianluca Amato, Matteo, Bergamaschi, Silvia Giulia Galfr\`e, Alessandro Marchetti, Michelangelo, Vegli\`o, Maurizio Parton, Francesco Morandin

PDF

Open Access 1 Video

TL;DR

This paper proposes a new neural network unit with multiple biases, demonstrating that increasing biases can be more effective than increasing weights for improving model performance, supported by empirical and theoretical analysis.

Contribution

Introduces a novel neural network unit with multiple biases, emphasizing bias increases over weight increases for better information flow and performance.

Findings

01

Bias-focused units can outperform weight-focused ones.

02

Empirical results show improved accuracy with bias enhancements.

03

Theoretical analysis supports bias-based optimization benefits.

Abstract

We introduce a novel computational unit for neural networks that features multiple biases, challenging the traditional perceptron structure. This unit emphasizes the importance of preserving uncorrupted information as it is passed from one unit to the next, applying activation functions later in the process with specialized biases for each unit. Through both empirical and theoretical analyses, we show that by focusing on increasing biases rather than weights, there is potential for significant enhancement in a neural network model's performance. This approach offers an alternative perspective on optimizing information flow within neural networks. See source code at https://github.com/CuriosAI/dac-dev.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Increasing Biases Can Be More Efficient Than Increasing Weights· youtube

Taxonomy

TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Machine Learning and ELM

MethodsTest · Dynamic Algorithm Configuration · 1x1 Convolution · Bottleneck Residual Block · Average Pooling · Residual Connection · Max Pooling · Convolution · Batch Normalization · Global Average Pooling