MOGNET: A Mux-residual quantized Network leveraging Online-Generated   weights

Van Thien Nguyen; William Guicquero; Gilles Sicard

arXiv:2501.09531·cs.LG·January 17, 2025

MOGNET: A Mux-residual quantized Network leveraging Online-Generated weights

Van Thien Nguyen, William Guicquero, Gilles Sicard

PDF

TL;DR

MOGNET is a compact, resource-efficient neural network architecture that uses online-generated weights and low-precision quantization to achieve higher accuracy within tiny memory budgets.

Contribution

It introduces a novel Mux-residual quantized network with online-generated weights and a new weight ternarization method for resource-constrained hardware.

Findings

01

Achieves up to 1% higher accuracy than recent methods at similar or smaller model size.

02

Operates effectively within a sub-2Mb memory budget.

03

Utilizes online-generated weights and low-precision quantization for efficiency.

Abstract

This paper presents a compact model architecture called MOGNET, compatible with a resource-limited hardware. MOGNET uses a streamlined Convolutional factorization block based on a combination of 2 point-wise (1x1) convolutions with a group-wise convolution in-between. To further limit the overall model size and reduce the on-chip required memory, the second point-wise convolution's parameters are on-line generated by a Cellular Automaton structure. In addition, MOGNET enables the use of low-precision weights and activations, by taking advantage of a Multiplexer mechanism with a proper Bitshift rescaling for integrating residual paths without increasing the hardware-related complexity. To efficiently train this model we also introduce a novel weight ternarization method favoring the balance between quantized levels. Experimental results show that given tiny memory budget (sub-2Mb),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution