Neural Arithmetic Units

Andreas Madsen; Alexander Rosenberg Johansen

arXiv:2001.05016·cs.NE·January 16, 2020·20 cites

Neural Arithmetic Units

Andreas Madsen, Alexander Rosenberg Johansen

PDF

Open Access 3 Repos

TL;DR

This paper introduces Neural Addition and Multiplication Units that enable neural networks to perform exact arithmetic operations, improving their ability to extrapolate and learn these functions efficiently.

Contribution

The paper presents novel neural network components, NAU and NMU, designed specifically for exact addition, subtraction, and multiplication, with insights on their initialization and regularization.

Findings

01

NAU and NMU converge more consistently than previous units

02

They require fewer parameters and learn faster

03

They can extrapolate to negative and small values

Abstract

Neural networks can approximate complex functions, but they struggle to perform exact arithmetic operations over real numbers. The lack of inductive bias for arithmetic operations leaves neural networks without the underlying logic necessary to extrapolate on tasks such as addition, subtraction, and multiplication. We present two new neural network components: the Neural Addition Unit (NAU), which can learn exact addition and subtraction; and the Neural Multiplication Unit (NMU) that can multiply subsets of a vector. The NMU is, to our knowledge, the first arithmetic neural network component that can learn to multiply elements from a vector, when the hidden size is large. The two new components draw inspiration from a theoretical analysis of recently proposed arithmetic components. We find that careful initialization, restricting parameter space, and regularizing for sparsity is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Numerical Methods and Algorithms