AoCStream: All-on-Chip CNN Accelerator With Stream-Based Line-Buffer Architecture
Hyeong-Ju Kang

TL;DR
AoCStream introduces a stream-based line-buffer architecture for CNN acceleration that significantly reduces memory requirements, enabling efficient on-chip implementation of object detection CNNs on low-end FPGAs without external memory.
Contribution
The paper proposes a novel stream-based line-buffer architecture and accelerator-aware pruning, reducing memory and resource usage for CNNs on low-end FPGAs, unlike traditional frame-based methods.
Findings
Achieves on-chip CNN implementation on low-end FPGA without external memory.
Higher throughput and efficiency compared to similar accelerators.
Reduces intermediate data and weight memory significantly.
Abstract
Convolutional neural network (CNN) accelerators are being widely used for their efficiency, but they require a large amount of memory, leading to the use of a slow and power consuming external memory. This paper exploits two schemes to reduce the required memory amount and ultimately to implement a CNN of reasonable performance only with on-chip memory of a practical device like a low-end FPGA. To reduce the memory amount of the intermediate data, a stream-based line-buffer architecture and a dataflow for the architecture are proposed instead of the conventional frame-based architecture, where the amount of the intermediate data memory is proportional to the square of the input image size. The architecture consists of layer-dedicated blocks operating in a pipelined way with the input and output streams. Each convolutional layer block has a line buffer storing just a few rows of input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Brain Tumor Detection and Classification
