Stepping Forward on the Last Mile

Chen Feng; Shaojie Zhuo; Xiaopeng Zhang; Ramchalam Kinattinkara; Ramakrishnan; Zhaocong Yuan; Andrew Zou Li

arXiv:2411.04036·cs.LG·November 7, 2024

Stepping Forward on the Last Mile

Chen Feng, Shaojie Zhuo, Xiaopeng Zhang, Ramchalam Kinattinkara, Ramakrishnan, Zhaocong Yuan, Andrew Zou Li

PDF

Open Access 1 Video

TL;DR

This paper explores the use of fixed-point forward gradients for on-device training of deep learning models on resource-constrained edge devices, demonstrating its feasibility and proposing improvements to reduce memory and accuracy gaps.

Contribution

It investigates fixed-point forward gradient training for edge devices, introduces algorithm enhancements, and provides empirical analysis of its effectiveness across vision and audio tasks.

Findings

01

Feasible on-device training with fixed-point forward gradients demonstrated.

02

Proposed algorithm enhancements reduce memory footprint and improve accuracy.

03

Empirical results show competitive performance compared to traditional backpropagation.

Abstract

Continuously adapting pre-trained models to local data on resource constrained edge devices is the $last mile$ for model deployment. However, as models increase in size and depth, backpropagation requires a large amount of memory, which becomes prohibitive for edge devices. In addition, most existing low power neural processing engines (e.g., NPUs, DSPs, MCUs, etc.) are designed as fixed-point inference accelerators, without training capabilities. Forward gradients, solely based on directional derivatives computed from two forward calls, have been recently used for model training, with substantial savings in computation and memory. However, the performance of quantized training with fixed-point forward gradients remains unclear. In this paper, we investigate the feasibility of on-device training using fixed-point forward gradients, by conducting comprehensive experiments across a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Stepping Forward on the Last Mile· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis