HEP-BNN: A Framework for Finding Low-Latency Execution Configurations of BNNs on Heterogeneous Multiprocessor Platforms
Leonard David Bereholschi, Ching-Chi Lin, Mikail Yayla, Jian-Jia Chen

TL;DR
This paper introduces HEP-BNN, a framework that optimizes the execution of Binarized Neural Networks on heterogeneous CPU-GPU platforms by finding low-latency layer-to-device mappings, significantly improving inference speed.
Contribution
It presents a novel framework for efficient BNN workload mapping on heterogeneous platforms, optimizing performance beyond fully-parallelized GPU implementations.
Findings
Achieves up to 11.8x faster inference on tested hardware.
Effectively balances workload between CPU and GPU.
Demonstrates improvements on Fashion-MNIST and CIFAR-10 datasets.
Abstract
Binarized Neural Networks (BNNs) significantly reduce the computation and memory demands with binarized weights and activations compared to full-precision NNs. Executing a layer in a BNN on different devices of a heterogeneous multiprocessor platform consisting of CPU and GPU can affect the inference performance, i.e., accuracy and latency. Usually, a heterogeneous HW platform consisting of a CPU and a GPU is available to execute the BNN workloads. However, to use the heterogeneous HW effectively, it is necessary to find an efficient strategy for BNN workload mapping. In this work, we propose a framework that generates efficient BNN layer-to-device mappings (i.e. suitable parallel configuration for each layer of the model) for execution platforms comprised of CPU and CUDA-capable GPU. We evaluate our proposed framework with two BNN architectures using two well-known datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Machine Learning in Materials Science
