Towards a tailored mixed-precision sub-8-bit quantization scheme for Gated Recurrent Units using Genetic Algorithms
Riccardo Miccini, Alessandro Cerioli, Cl\'ement Laroche, Tobias, Piechowiak, Jens Spars{\o}, Luca Pezzarossa

TL;DR
This paper introduces a genetic algorithm-based method for optimizing mixed-precision quantization of Gated Recurrent Units, achieving significant model size reduction while preserving accuracy on sequential tasks.
Contribution
It proposes a modular, per-operator bit width selection scheme for GRUs and employs genetic algorithms to optimize model size and accuracy simultaneously.
Findings
Achieves 25-55% model size reduction.
Outperforms homogeneous-precision quantization in Pareto efficiency.
Maintains accuracy comparable to 8-bit models.
Abstract
Despite the recent advances in model compression techniques for deep neural networks, deploying such models on ultra-low-power embedded devices still proves challenging. In particular, quantization schemes for Gated Recurrent Units (GRU) are difficult to tune due to their dependence on an internal state, preventing them from fully benefiting from sub-8bit quantization. In this work, we propose a modular integer quantization scheme for GRUs where the bit width of each operator can be selected independently. We then employ Genetic Algorithms (GA) to explore the vast search space of possible bit widths, simultaneously optimising for model size and accuracy. We evaluate our methods on four different sequential tasks and demonstrate that mixed-precision solutions exceed homogeneous-precision ones in terms of Pareto efficiency. In our results, we achieve a model size reduction between 25% and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnalog and Mixed-Signal Circuit Design · Advanced Data Compression Techniques · Digital Filter Design and Implementation
