DeFiNES: Enabling Fast Exploration of the Depth-first Scheduling Space for DNN Accelerators through Analytical Modeling
Linyan Mei, Koen Goetschalckx, Arne Symons, Marian Verhelst

TL;DR
DeFiNES is a unified analytical modeling framework that enables rapid and accurate exploration of the entire depth-first scheduling space for DNN accelerators, considering detailed data access costs across memory hierarchies.
Contribution
This work introduces DeFiNES, the first comprehensive analytical model supporting both layer-by-layer and depth-first scheduling for DNN accelerators, addressing previous modeling limitations.
Findings
DeFiNES achieves up to 10X better solutions compared to state-of-the-art.
The model accurately predicts hardware costs validated on a real DNN accelerator.
DeFiNES enables efficient exploration of scheduling strategies affecting energy and latency.
Abstract
DNN workloads can be scheduled onto DNN accelerators in many different ways: from layer-by-layer scheduling to cross-layer depth-first scheduling (a.k.a. layer fusion, or cascaded execution). This results in a very broad scheduling space, with each schedule leading to varying hardware (HW) costs in terms of energy and latency. To rapidly explore this vast space for a wide variety of hardware architectures, analytical cost models are crucial to estimate scheduling effects on the HW level. However, state-of-the-art cost models are lacking support for exploring the complete depth-first scheduling space, for instance focusing only on activations while ignoring weights, or modeling only DRAM accesses while overlooking on-chip data movements. These limitations prevent researchers from systematically and accurately understanding the depth-first scheduling space. After formalizing this design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices
