Using System Hyper Pipelining (SHP) to Improve the Performance of a   Coarse-Grained Reconfigurable Architecture (CGRA) Mapped on an FPGA

Tobias Strauch

arXiv:1508.07139·cs.AR·August 31, 2015·1 cites

Using System Hyper Pipelining (SHP) to Improve the Performance of a Coarse-Grained Reconfigurable Architecture (CGRA) Mapped on an FPGA

Tobias Strauch

PDF

Open Access

TL;DR

This paper introduces System Hyper Pipelining (SHP) applied to CGRA PEs on FPGA, enabling dynamic threading and improved performance through flexible thread management and reduced data traffic.

Contribution

It presents a novel application of SHP to CGRA PEs, enhancing performance and flexibility in FPGA-based architectures.

Findings

01

Increased performance per PE using SHP

02

Reduced data traffic on CGRA routing structure

03

Effective implementation of Fork-Join operations

Abstract

The well known method C-Slow Retiming (CSR) can be used to automatically convert a given CPU into a multithreaded CPU with independent threads. These CPUs are then called streaming or barrel processors. System Hyper Pipelining (SHP) adds a new flexibility on top of CSR by allowing a dynamic number of threads to be executed and by enabling the threads to be stalled, bypassed and reordered. SHP is now applied on the programming elements (PE) of a coarse-grained reconfigurable architecture (CGRA). By using SHP, more performance can be achieved per PE. Fork-Join operations can be implemented on a PE using the flexibility provided by SHP to dynamically adjust the number of threads per PE. Multiple threads can share the same data locally, which greatly reduces the data traffic load on the CGRA's routing structure. The paper shows the results of a CGRA using SHP-ed RISC-V cores as PEs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInterconnection Networks and Systems · Embedded Systems Design Techniques · VLSI and FPGA Design Techniques