A Hardware-Friendly Algorithm for Scalable Training and Deployment of   Dimensionality Reduction Models on FPGA

Mahdi Nazemi; Amir Erfan Eshratifar; Massoud Pedram

arXiv:1801.04014·cs.LG·January 22, 2018

A Hardware-Friendly Algorithm for Scalable Training and Deployment of Dimensionality Reduction Models on FPGA

Mahdi Nazemi, Amir Erfan Eshratifar, Massoud Pedram

PDF

Open Access

TL;DR

This paper introduces a hardware-friendly, scalable algorithm for training and deploying dimensionality reduction models on FPGA, significantly reducing resource use while maintaining accuracy.

Contribution

It presents a novel algorithm and hardware implementation that enable efficient training and deployment of dimensionality reduction models on FPGA, addressing hardware training challenges.

Findings

01

Resource consumption reduced by 50%

02

No degradation in model accuracy

03

Applicable to various dimensionality reduction models

Abstract

With ever-increasing application of machine learning models in various domains such as image classification, speech recognition and synthesis, and health care, designing efficient hardware for these models has gained a lot of popularity. While the majority of researches in this area focus on efficient deployment of machine learning models (a.k.a inference), this work concentrates on challenges of training these models in hardware. In particular, this paper presents a high-performance, scalable, reconfigurable solution for both training and deployment of different dimensionality reduction models in hardware by introducing a hardware-friendly algorithm. Compared to state-of-the-art implementations, our proposed algorithm and its hardware realization decrease resource consumption by 50\% without any degradation in accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Parallel Computing and Optimization Techniques