A Hardware-Friendly Algorithm for Scalable Training and Deployment of Dimensionality Reduction Models on FPGA
Mahdi Nazemi, Amir Erfan Eshratifar, Massoud Pedram

TL;DR
This paper introduces a hardware-friendly, scalable algorithm for training and deploying dimensionality reduction models on FPGA, significantly reducing resource use while maintaining accuracy.
Contribution
It presents a novel algorithm and hardware implementation that enable efficient training and deployment of dimensionality reduction models on FPGA, addressing hardware training challenges.
Findings
Resource consumption reduced by 50%
No degradation in model accuracy
Applicable to various dimensionality reduction models
Abstract
With ever-increasing application of machine learning models in various domains such as image classification, speech recognition and synthesis, and health care, designing efficient hardware for these models has gained a lot of popularity. While the majority of researches in this area focus on efficient deployment of machine learning models (a.k.a inference), this work concentrates on challenges of training these models in hardware. In particular, this paper presents a high-performance, scalable, reconfigurable solution for both training and deployment of different dimensionality reduction models in hardware by introducing a hardware-friendly algorithm. Compared to state-of-the-art implementations, our proposed algorithm and its hardware realization decrease resource consumption by 50\% without any degradation in accuracy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Parallel Computing and Optimization Techniques
