A Configurable and Efficient Memory Hierarchy for Neural Network   Hardware Accelerator

Oliver Bause; Paul Palomero Bernardo; Oliver Bringmann

arXiv:2404.15823·cs.AR·April 25, 2024·1 cites

A Configurable and Efficient Memory Hierarchy for Neural Network Hardware Accelerator

Oliver Bause, Paul Palomero Bernardo, Oliver Bringmann

PDF

Open Access

TL;DR

This paper introduces a configurable memory hierarchy framework for neural network accelerators that optimizes memory capacity and performance, reducing chip area significantly while maintaining high efficiency.

Contribution

It presents a novel, flexible memory hierarchy design with up to five levels and an optional shift register, tailored for DNN layer access patterns, improving hardware efficiency.

Findings

01

Up to 62.2% reduction in chip area.

02

Performance loss minimized to 2.4%.

03

Efficient execution of most DNN layer access patterns.

Abstract

As machine learning applications continue to evolve, the demand for efficient hardware accelerators, specifically tailored for deep neural networks (DNNs), becomes increasingly vital. In this paper, we propose a configurable memory hierarchy framework tailored for per layer adaptive memory access patterns of DNNs. The hierarchy requests data on-demand from the off-chip memory to provide it to the accelerator's compute units. The objective is to strike an optimized balance between minimizing the required memory capacity and maintaining high accelerator performance. The framework is characterized by its configurability, allowing the creation of a tailored memory hierarchy with up to five levels. Furthermore, the framework incorporates an optional shift register as final level to increase the flexibility of the memory management process. A comprehensive loop-nest analysis of DNN layers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Brain Tumor Detection and Classification