# 3D-aCortex: An Ultra-Compact Energy-Efficient Neurocomputing Platform   Based on Commercial 3D-NAND Flash Memories

**Authors:** Mohammad Bavandpour, Shubham Sahay, Mohammad Reza Mahmoodi, Dmitri B., Strukov

arXiv: 1908.02472 · 2019-08-08

## TL;DR

This paper introduces 3D-aCortex, an energy-efficient neuromorphic platform using commercial 3D-NAND flash memories for dense vector-matrix multiplication, achieving record-breaking efficiency and throughput in neural inference tasks.

## Contribution

It develops a novel VMM circuit based on unmodified 3D-NAND flash memories and integrates it into a neuromorphic processor, demonstrating unprecedented efficiency and performance.

## Key findings

- Area efficiency of 0.14 um2/byte for 5-bit VMM
- Energy efficiency of ~10 fJ/Op including peripherals
- Peak throughput of 10.66 TOps/s and storage efficiency of 4.34 MB/mm2

## Abstract

The first contribution of this paper is the development of extremely dense, energy-efficient mixed-signal vector-by-matrix-multiplication (VMM) circuits based on the existing 3D-NAND flash memory blocks, without any need for their modification. Such compatibility is achieved using time-domain-encoded VMM design. Our detailed simulations have shown that, for example, the 5-bit VMM of 200-element vectors, using the commercially available 64-layer gate-all-around macaroni-type 3D-NAND memory blocks designed in the 55-nm technology node, may provide an unprecedented area efficiency of 0.14 um2/byte and energy efficiency of ~10 fJ/Op, including the input/output and other peripheral circuitry overheads. Our second major contribution is the development of 3D-aCortex, a multi-purpose neuromorphic inference processor that utilizes the proposed 3D-VMM blocks as its core processing units. We have performed rigorous performance simulations of such a processor on both circuit and system levels, taking into account non-idealities such as drain-induced barrier lowering, capacitive coupling, charge injection, parasitics, process variations, and noise. Our modeling of the 3D-aCortex performing several state-of-the-art neuromorphic-network benchmarks has shown that it may provide the record-breaking storage efficiency of 4.34 MB/mm2, the peak energy efficiency of 70.43 TOps/J, and the computational throughput up to 10.66 TOps/s. The storage efficiency can be further improved seven-fold by aggressively sharing VMM peripheral circuits at the cost of slight decrease in energy efficiency and throughput.

---
Source: https://tomesphere.com/paper/1908.02472