FPIRM: Floating-point Processing in Racetrack Memories

S\'ebastien Ollivier; Xinyi Zhang; Yue Tang; Chayanika Choudhuri,; Jingtong Hu; and Alex K. Jones

arXiv:2204.13788·cs.ET·August 2, 2022

FPIRM: Floating-point Processing in Racetrack Memories

S\'ebastien Ollivier, Xinyi Zhang, Yue Tang, Chayanika Choudhuri,, Jingtong Hu, and Alex K. Jones

PDF

Open Access

TL;DR

FPIRM introduces a Racetrack Memory-based compute-in-memory technique that accelerates CNN inference and training at the edge, reducing energy consumption and increasing throughput compared to FPGA implementations.

Contribution

This paper presents FPIRM, a novel Racetrack Memory-based CIM approach enabling efficient multi-operand computations for CNNs, supporting both integer and floating point arithmetic.

Findings

01

FPIRM reduces energy consumption by at least 27% compared to FPGA.

02

FPIRM increases throughput by at least 18% over FPGA.

03

FPIRM achieves 2× efficiency improvement during CNN training.

Abstract

Convolutional neural networks (CNN) have become a ubiquitous algorithm with growing applications in mobile and edge settings. We describe a compute-in-memory (CIM) technique called FPIRM using Racetrack Memory (RM) to accelerate CNNs for edge systems. Using transverse read, a technique that can determine the number of '1's multiple adjacent domains, FPIRM can efficiently implement multi-operand bulk-bitwise and addition computations, and two-operand multiplication. We discuss how FPIRM can implement both variable precision integer and floating point arithmetic. This allows both CNN inference and on-device training without expensive data movement to the cloud. Based on these functions we demonstrate implementation of several CNNs with back propagation using RM CIM and compare these to state-of-the-art implementations of CIM inference and training in Field-Programmable Gate Arrays. During…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Parallel Computing and Optimization Techniques · Advanced Memory and Neural Computing