Flexible Vector Integration in Embedded RISC-V SoCs for End to End CNN Inference Acceleration
Dmitri Lyalikov

TL;DR
This paper presents a flexible vector integration approach in embedded RISC-V SoCs that accelerates CNN inference by optimizing hardware placement, reducing bottlenecks, and leveraging RISC-V Vector 1.0 for balanced, power-efficient execution.
Contribution
It introduces a novel system integration and compilation model utilizing RISC-V Vector 1.0 to enhance CNN inference performance on resource-constrained embedded platforms.
Findings
Up to 9x speedup in image pre-processing.
Up to 3x reduction in YOLOv3 fallback layer execution time.
Demonstrates power-efficient, balanced computation on heterogeneous SoCs.
Abstract
The emergence of heterogeneity and domain-specific architectures targeting deep learning inference show great potential for enabling the deployment of modern CNNs on resource-constrained embedded platforms. A significant development is the diversification of custom hardware solely targeting the most expensive parts of CNNs. DLAs (deep learning accelerators) and NPUs (neural processing units), among others, can overcome the approaching limits of traditional silicon scaling and provide a solution to the power/performance tradeoff within embedded SoCs. Efficient DSA utilization requires proper system integration and a compilation/execution model for balanced execution in these heterogeneous architectures. There is a critical need for proper system integration and an efficient compilation/execution model for balanced execution in these heterogeneous architectures. This work highlights the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Integrated Circuits and Semiconductor Failure Analysis · Image Processing Techniques and Applications
