Neural Arithmetic Units
Andreas Madsen, Alexander Rosenberg Johansen

TL;DR
This paper introduces Neural Addition and Multiplication Units that enable neural networks to perform exact arithmetic operations, improving their ability to extrapolate and learn these functions efficiently.
Contribution
The paper presents novel neural network components, NAU and NMU, designed specifically for exact addition, subtraction, and multiplication, with insights on their initialization and regularization.
Findings
NAU and NMU converge more consistently than previous units
They require fewer parameters and learn faster
They can extrapolate to negative and small values
Abstract
Neural networks can approximate complex functions, but they struggle to perform exact arithmetic operations over real numbers. The lack of inductive bias for arithmetic operations leaves neural networks without the underlying logic necessary to extrapolate on tasks such as addition, subtraction, and multiplication. We present two new neural network components: the Neural Addition Unit (NAU), which can learn exact addition and subtraction; and the Neural Multiplication Unit (NMU) that can multiply subsets of a vector. The NMU is, to our knowledge, the first arithmetic neural network component that can learn to multiply elements from a vector, when the hidden size is large. The two new components draw inspiration from a theoretical analysis of recently proposed arithmetic components. We find that careful initialization, restricting parameter space, and regularizing for sparsity is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Numerical Methods and Algorithms
