One Forward is Enough for Neural Network Training via Likelihood Ratio   Method

Jinyang Jiang; Zeliang Zhang; Chenliang Xu; Zhaofei Yu; Yijie Peng

arXiv:2305.08960·cs.LG·October 16, 2023·2 cites

One Forward is Enough for Neural Network Training via Likelihood Ratio Method

Jinyang Jiang, Zeliang Zhang, Chenliang Xu, Zhaofei Yu, Yijie Peng

PDF

Open Access 1 Video

TL;DR

This paper introduces a likelihood ratio method for neural network training that requires only one forward pass, offering greater flexibility and efficiency compared to traditional backpropagation.

Contribution

The authors propose a unified likelihood ratio approach for gradient estimation, eliminating recursive backpropagation and enabling flexible architecture design and device adaptation.

Findings

01

ULR achieves effective training with a single forward pass.

02

The method improves training flexibility and robustness.

03

Variance reduction techniques accelerate convergence.

Abstract

While backpropagation (BP) is the mainstream approach for gradient computation in neural network training, its heavy reliance on the chain rule of differentiation constrains the designing flexibility of network architecture and training pipelines. We avoid the recursive computation in BP and develop a unified likelihood ratio (ULR) method for gradient estimation with just one forward propagation. Not only can ULR be extended to train a wide variety of neural network architectures, but the computation flow in BP can also be rearranged by ULR for better device adaptation. Moreover, we propose several variance reduction techniques to further accelerate the training process. Our experiments offer numerical results across diverse aspects, including various neural network training scenarios, computation flow rearrangement, and fine-tuning of pre-trained models. All findings demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

One Forward is Enough for Neural Network Training via Likelihood Ratio Method· slideslive

Taxonomy

TopicsMachine Learning and ELM · Neural Networks and Applications · Advanced Neural Network Applications