Design and Implementation of BNN-Based Object Detection on FPGA
Xuyu Zhao, Yunpeng Wu, Mengyuan Zhu, Haoyu Huang, Xiaoyu Xu, Yanjing Li, Gaolong Zhang, Baochang Zhang

TL;DR
This paper presents a low-cost FPGA implementation of a BNN-based object detector inspired by YOLOv3-tiny, achieving efficient detection with high accuracy and low resource usage.
Contribution
It introduces a complete hardware design for a BNN-based object detector on FPGA, including model conversion, fixed-point implementation, and optimized RTL architecture.
Findings
Achieves 39.6% mAP50 on VOC dataset
Uses only 0.098 GFLOPs and 0.74 million parameters
RTL simulation shows high correlation with ONNX model outputs
Abstract
This paper implements a Binary Neural Network (BNN) based YOLOv3-tiny-like object detector on a low-cost FPGA. The network takes 320*320*3 RGB images as input. Its main convolution layers use 1-bit weights and 8-bit activations, while Conv1 and the final detection head use fixed-point standard convolutions. From the trained ONNX model, weights, biases, and quantization parameters are extracted, converted to fixed point, packed into COE files, and stored in Vivado BRAM ROMs. The hardware is written fully in Verilog RTL and includes padding, line buffering, binary convolution, quantization post-processing, max pooling, and detection-head computation. For layers where Mul_prev is indexed by input channel and Div_current by output channel, Mul_prev is fused in-to the BNN PE so that channel-wise compensation is applied during accumulation. On VOC, the model obtains 39.6% mAP50 with 0.098…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
