NeuroBlend: Towards Low-Power yet Accurate Neural Network-Based Inference Engine Blending Binary and Fixed-Point Convolutions
Arash Fayyazi, Mahdi Nazemi, Arya Fayyazi, Massoud Pedram

TL;DR
NeuroBlend introduces a hybrid neural network architecture combining binary and fixed-point convolutions, optimized for FPGA deployment to achieve high accuracy, low latency, and reduced power consumption on image classification tasks.
Contribution
This work presents NeuroBlend, a novel architecture with a unique Blend module, and a hardware compiler for FPGA mapping, achieving improved accuracy and efficiency over prior binary neural networks.
Findings
Achieves 88.0 ext% accuracy on CIFAR-10 with faster inference
Reduces model size by 3.5x compared to full-precision models
Lowers power consumption by 2.5x using DSP-based logic operations
Abstract
This paper introduces NeuroBlend, a novel neural network architecture featuring a unique building block known as the Blend module. This module incorporates binary and fixed-point convolutions in its main and skip paths, respectively. There is a judicious deployment of batch normalizations on both main and skip paths inside the Blend module and in between consecutive Blend modules. Additionally, we present a compiler and hardware architecture designed to map NeuroBlend models onto FPGA devices, aiming to minimize inference latency while maintaining high accuracy. Our NeuroBlend-20 (NeuroBlend-18) model, derived from ResNet-20 (ResNet-18) trained on CIFAR-10 (CIFAR-100), achieves 88.0\% (73.73\%) classification accuracy, outperforming state-of-the-art binary neural networks by 0.8\% (1.33\%), with an inference time of 0.38ms per image, 1.4x faster than previous FPGA implementation for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Machine Learning and ELM
