FPGA Processor In Memory Architectures (PIMs): Overlay or Overhaul ?
MD Arafat Kabir, Ehsan Kabir, Joshua Hollis, Eli Levy-Mackay,, Atiyehsadat Panahi, Jason Bakos, Miaoqing Huang, David Andrews

TL;DR
This paper compares FPGA-based PIM architectures and overlays, demonstrating that a PIM overlay can achieve high throughput and efficiency, and can enhance custom PIM designs.
Contribution
It provides a comparative analysis of custom FPGA PIM architectures and a novel PIM overlay, introducing PiCaSO and showing how overlay features improve custom designs.
Findings
PiCaSO achieves 80% of peak throughput of custom designs
PiCaSO has 2.56x shorter latency than custom architectures
Overlay features improve custom PIM throughput by 18%, latency by 19.5%, and efficiency by 6.2%
Abstract
The dominance of machine learning and the ending of Moore's law have renewed interests in Processor in Memory (PIM) architectures. This interest has produced several recent proposals to modify an FPGA's BRAM architecture to form a next-generation PIM reconfigurable fabric. PIM architectures can also be realized within today's FPGAs as overlays without the need to modify the underlying FPGA architecture. To date, there has been no study to understand the comparative advantages of the two approaches. In this paper, we present a study that explores the comparative advantages between two proposed custom architectures and a PIM overlay running on a commodity FPGA. We created PiCaSO, a Processor in/near Memory Scalable and Fast Overlay architecture as a representative PIM overlay. The results of this study show that the PiCaSO overlay achieves up to 80% of the peak throughput of the custom…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Advanced Memory and Neural Computing
