# ROMANet: Fine-Grained Reuse-Driven Off-Chip Memory Access Management and   Data Organization for Deep Neural Network Accelerators

**Authors:** Rachmad Vidya Wicaksana Putra, Muhammad Abdullah Hanif, Muhammad, Shafique

arXiv: 1902.10222 · 2023-03-06

## TL;DR

ROMANet is a methodology that optimizes data partitioning, scheduling, and DRAM data mapping in DNN accelerators to significantly reduce off-chip memory access energy and improve throughput.

## Contribution

It introduces a novel energy-aware data management approach that minimizes DRAM accesses and exploits DRAM features for enhanced efficiency in DNN accelerators.

## Key findings

- DRAM access energy reduced by up to 46% in tested networks.
- DRAM throughput increased by 10% with ROMANet.
- Significant energy savings and throughput improvements over state-of-the-art methods.

## Abstract

Enabling high energy efficiency is crucial for embedded implementations of deep learning. Several studies have shown that the DRAM-based off-chip memory accesses are one of the most energy-consuming operations in deep neural network (DNN) accelerators, and thereby limit the designs from achieving efficiency gains at the full potential. DRAM access energy varies depending upon the number of accesses required as well as the energy consumed per-access. Therefore, searching for a solution towards the minimum DRAM access energy is an important optimization problem. Towards this, we propose the ROMANet methodology that aims at reducing the number of memory accesses, by searching for the appropriate data partitioning and scheduling for each layer of a network using a design space exploration, based on the knowledge of the available on-chip memory and the data reuse factors. Moreover, ROMANet also targets decreasing the number of DRAM row buffer conflicts and misses, by exploiting the DRAM multi-bank burst feature to improve the energy-per-access. Besides providing the energy benefits, our proposed DRAM data mapping also results in an increased effective DRAM throughput, which is useful for latency-constraint scenarios. Our experimental results show that the ROMANet saves DRAM access energy by 12% for the AlexNet, by 36% for the VGG-16, and by 46% for the MobileNet, while also improving the DRAM throughput by 10%, as compared to the state-of-the-art.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.10222/full.md

## Figures

26 figures with captions in the complete paper: https://tomesphere.com/paper/1902.10222/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1902.10222/full.md

---
Source: https://tomesphere.com/paper/1902.10222