FILCO: Flexible Composing Architecture with Real-Time Reconfigurability for DNN Acceleration
Xingzhen Chen, Jinming Zhuang, Zhuoping Yang, Shixin Ji, Sarah Schultz, Zheng Dong, Weisong Shi, Peipei Zhou

TL;DR
FILCO is a reconfigurable DNN accelerator architecture that dynamically adapts to diverse workloads, significantly improving throughput and efficiency compared to prior fixed or overlay architectures.
Contribution
The paper introduces FILCO, a flexible, real-time reconfigurable architecture for DNN acceleration, with an analytical framework for optimal design and demonstrated hardware efficiency gains.
Findings
Achieves 1.3x to 5x throughput improvements over prior architectures.
Supports real-time reconfiguration into unified or independent accelerators.
Validated on 7nm AMD Versal VCK190 hardware.
Abstract
With the development of deep neural network (DNN) enabled applications, achieving high hardware resource efficiency on diverse workloads is non-trivial in heterogeneous computing platforms. Prior works discuss dedicated architectures to achieve maximal resource efficiency. However, a mismatch between hardware and workloads always exists in various diverse workloads. Other works discuss overlay architecture that can dynamically switch dataflow for different workloads. However, these works are still limited by flexibility granularity and induce much resource inefficiency. To solve this problem, we propose a flexible composing architecture, FILCO, that can efficiently match diverse workloads to achieve the optimal storage and computation resource efficiency. FILCO can be reconfigured in real-time and flexibly composed into a unified or multiple independent accelerators. We also propose the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
