Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors
Matt Gorbett, Hossein Shirazi, Indrakshi Ray

TL;DR
This paper introduces Tiled Bit Networks, a novel quantization method that reuses learnable binary vectors to achieve sub-bit neural network compression, maintaining near full-precision performance across diverse architectures and tasks.
Contribution
It proposes a new tiling-based quantization approach for binary neural networks that significantly reduces model size while preserving accuracy.
Findings
Achieves up to 8x size reduction compared to binary models
Maintains near full-precision accuracy on various architectures and tasks
Demonstrates feasibility in resource-constrained environments
Abstract
Binary Neural Networks (BNNs) enable efficient deep learning by saving on storage and computational costs. However, as the size of neural networks continues to grow, meeting computational requirements remains a challenge. In this work, we propose a new form of quantization to tile neural network layers with sequences of bits to achieve sub-bit compression of binary-weighted neural networks. The method learns binary vectors (i.e. tiles) to populate each layer of a model via aggregation and reshaping operations. During inference, the method reuses a single tile per layer to represent the full tensor. We employ the approach to both fully-connected and convolutional layers, which make up the breadth of space in most neural architectures. Empirically, the approach achieves near fullprecision performance on a diverse range of architectures (CNNs, Transformers, MLPs) and tasks (classification,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Quantum-Dot Cellular Automata
