GAVINA: flexible aggressive undervolting for bit-serial mixed-precision DNN acceleration

Jordi Fornt; Pau Fontova-Must\'e; Adrian Gras; Omar Lahyani; Mart\'i Caro; Jaume Abella; Francesc Moll; Josep Altet

arXiv:2511.23203·cs.AR·December 1, 2025

GAVINA: flexible aggressive undervolting for bit-serial mixed-precision DNN acceleration

Jordi Fornt, Pau Fontova-Must\'e, Adrian Gras, Omar Lahyani, Mart\'i Caro, Jaume Abella, Francesc Moll, Josep Altet

PDF

Open Access

TL;DR

GAVINA introduces a novel mixed-precision DNN accelerator that employs aggressive undervolting combined with bit-serial computation, significantly improving energy efficiency while maintaining accuracy.

Contribution

It presents GAVINA, a flexible architecture enabling aggressive undervolting and mixed-precision support, with an error model and substantial energy efficiency gains.

Findings

01

Energy efficiency up to 89 TOP/sW achieved

02

20% energy efficiency boost via undervolting

03

Negligible accuracy loss on ResNet-18

Abstract

Voltage overscaling, or undervolting, is an enticing approximate technique in the context of energy-efficient Deep Neural Network (DNN) acceleration, given the quadratic relationship between power and voltage. Nevertheless, its very high error rate has thwarted its general adoption. Moreover, recent undervolting accelerators rely on 8-bit arithmetic and cannot compete with state-of-the-art low-precision (<8b) architectures. To overcome these issues, we propose a new technique called Guarded Aggressive underVolting (GAV), which combines the ideas of undervolting and bit-serial computation to create a flexible approximation method based on aggressively lowering the supply voltage on a select number of least significant bit combinations. Based on this idea, we implement GAVINA (GAV mIxed-precisioN Accelerator), a novel architecture that supports arbitrary mixed precision and flexible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLow-power high-performance VLSI design · Advanced Neural Network Applications · Numerical Methods and Algorithms