Achieving Efficient Realization of Kalman Filter on CGRA through Algorithm-Architecture Co-design
Farhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha,, S K Nandy, Ranjani Narayan

TL;DR
This paper demonstrates a highly efficient implementation of the Kalman Filter on CGRA hardware, achieving significant performance and energy efficiency improvements through algorithm-architecture co-design and optimization.
Contribution
It introduces a novel co-designed approach using MFA and CGRA to realize KF with high efficiency, scalability, and performance gains.
Findings
Achieved up to 65% of the platform's peak performance.
Realized a 116% Gflops improvement over initial implementations.
Attained 4-105x better Gflops/watt compared to existing solutions.
Abstract
In this paper, we present efficient realization of Kalman Filter (KF) that can achieve up to 65% of the theoretical peak performance of underlying architecture platform. KF is realized using Modified Faddeeva Algorithm (MFA) as a basic building block due to its versatility and REDEFINE Coarse Grained Reconfigurable Architecture (CGRA) is used as a platform for experiments since REDEFINE is capable of supporting realization of a set algorithmic compute structures at run-time on a Reconfigurable Data-path (RDP). We perform several hardware and software based optimizations in the realization of KF to achieve 116% improvement in terms of Gflops over the first realization of KF. Overall, with the presented approach for KF, 4-105x performance improvement in terms of Gflops/watt over several academically and commercially available realizations of KF is attained. In REDEFINE, we show that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
