Timing Driven C-Slow Retiming on RTL for MultiCores on FPGAs
Tobias Strauch

TL;DR
This paper presents a timing driven method for implementing C-Slow Retiming directly on RTL, enabling efficient multi-core FPGA designs by leveraging existing registers and avoiding verification issues associated with netlist-level modifications.
Contribution
It introduces a novel RTL-based, timing driven approach for C-Slow Retiming, facilitating its practical application in FPGA-based multi-core systems and high-level synthesis.
Findings
Successful RTL implementation of CSR on a RISC core
Improved core functionality with minimal verification issues
Guidelines for applying CSR in high-level synthesis
Abstract
In this paper C-Slow Retiming (CSR) on RTL is discussed. CSR multiplies the functionality of cores by adding the same number of registers into each path. The technique is ideal for FPGAs with their already existing registers. Previously publications are limited to adding registers on netlist level, which generates a lot of system verification problems and which is assumed to be the major drawback to use this technology in the modern multicore times. The paper shows how CSR can efficiently be done with timing driven automatic RTL modification. The methodology provided with this paper can be used as guidance for using CSR in high level synthesis (HLS). The paper shows the results of a CSR-ed complex RISC core on RTL implemented on FPGAs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbedded Systems Design Techniques · Formal Methods in Verification · Low-power high-performance VLSI design
