TL;DR
This paper introduces a modified version of the Caffe framework that supports FPGA implementations of CNNs, enabling flexible, reprogrammable, and efficient neural network processing on FPGA hardware.
Contribution
It presents an FPGA-compatible extension of Caffe, including a Winograd convolution engine, facilitating CNN deployment on FPGA with seamless integration and high performance.
Findings
Achieves 50 GFLOPS on 3x3 convolutions
Supports multiple CNN architectures like AlexNet and VGG
Enables flexible FPGA-based CNN deployment
Abstract
Convolutional Neural Networks (CNNs) have gained significant traction in the field of machine learning, particularly due to their high accuracy in visual recognition. Recent works have pushed the performance of GPU implementations of CNNs to significantly improve their classification and training times. With these improvements, many frameworks have become available for implementing CNNs on both CPUs and GPUs, with no support for FPGA implementations. In this work we present a modified version of the popular CNN framework Caffe, with FPGA support. This allows for classification using CNN models and specialized FPGA implementations with the flexibility of reprogramming the device when necessary, seamless memory transactions between host and device, simple-to-use test benches, and the ability to create pipelined layer implementations. To validate the framework, we use the Xilinx SDAccel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDropout · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · Ethereum Customer Service Number +1-833-534-1729 · Convolution
