Towards Error Correction for Computing in Racetrack Memory
Preston Brazzle, Benjamin F. Morris III, Evan McKinney, Peipei Zhou,, Jingtong Hu, Asif Ali Khan, and Alex K. Jones

TL;DR
This paper introduces CIRM-ECC, a novel error correction technique designed to protect Racetrack Memory-based computing-in-memory systems from faults, leveraging the memory's peripheral circuitry to implement ECC codes.
Contribution
It presents a new ECC method tailored for Racetrack Memory CIM systems, capable of protecting against faults using existing Hamming and BCH codes.
Findings
CIRM-ECC effectively corrects CIM faults in Racetrack Memory.
The approach integrates with existing ECC codes like Hamming and BCH.
Demonstrates improved fault tolerance in Racetrack Memory CIM architectures.
Abstract
Computing-in-memory (CIM) promises to alleviate the Von Neumann bottleneck and accelerate data-intensive applications. Depending on the underlying technology and configuration, CIM enables implementing compute primitives in place, such as multiplication, search operations, and bulk bitwise logic operations. Emerging nonvolatile memory technologies such as spintronic Racetrack memory (RTM) promise not only unprecedented density but also significant parallelism through CIM. However, most CIM designs, including those based on RTM, exhibit high fault rates. Existing error correction codes (ECC) are not homomorphic over bitwise operations such as AND and OR, and hence cannot protect against CIM faults. This paper proposes CIRM-ECC, a technique to protect spintronic RTMs against CIM faults. At the core of CIRM-ECC, we use a recently proposed RTM-based CIM approach and leverage its peripheral…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed systems and fault tolerance · Graph Theory and Algorithms
