Low-Resolution Neural Networks

Eduardo Lobo Lustosa Cabral; Larissa Driemeier

arXiv:2502.08795·cs.LG·February 14, 2025

Low-Resolution Neural Networks

Eduardo Lobo Lustosa Cabral, Larissa Driemeier

PDF

Open Access

TL;DR

This paper investigates the effects of reducing parameter bit precision in neural networks, demonstrating that low-resolution models can maintain performance with significant memory savings, especially when using 2.32-bit weights.

Contribution

It provides a comprehensive analysis of low-resolution neural networks across various architectures, highlighting the potential for memory-efficient models with minimal performance loss.

Findings

01

Low-resolution models with 2.32-bit weights perform comparably to 32-bit models.

02

Models with fewer parameters need more epochs to reach similar accuracy.

03

Including zero as a weight value stabilizes training in low-resolution models.

Abstract

The expanding scale of large neural network models introduces significant challenges, driving efforts to reduce memory usage and enhance computational efficiency. Such measures are crucial to ensure the practical implementation and effective application of these sophisticated models across a wide array of use cases. This study examines the impact of parameter bit precision on model performance compared to standard 32-bit models, with a focus on multiclass object classification in images. The models analyzed include those with fully connected layers, convolutional layers, and transformer blocks, with model weight resolution ranging from 1 bit to 4.08 bits. The findings indicate that models with lower parameter bit precision achieve results comparable to 32-bit models, showing promise for use in memory-constrained devices. While low-resolution models with a small number of parameters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus