# Statistical Mechanics of Node-perturbation Learning with Noisy Baseline

**Authors:** Kazuyuki Hara, Kentaro Katahira, and Masato Okada

arXiv: 1706.06953 · 2017-06-22

## TL;DR

This paper develops a statistical mechanics framework for node-perturbation learning with noisy baselines, deriving differential equations to analyze learning dynamics and generalization performance.

## Contribution

It extends the theoretical understanding of node-perturbation learning with noisy baselines by deriving coupled differential equations for learning dynamics.

## Key findings

- Derived differential equations for learning dynamics.
- Showed how to compute generalization error from the equations.
- Validated the applicability of Cho's results in general cases.

## Abstract

Node-perturbation learning is a type of statistical gradient descent algorithm that can be applied to problems where the objective function is not explicitly formulated, including reinforcement learning. It estimates the gradient of an objective function by using the change in the object function in response to the perturbation. The value of the objective function for an unperturbed output is called a baseline. Cho et al. proposed node-perturbation learning with a noisy baseline. In this paper, we report on building the statistical mechanics of Cho's model and on deriving coupled differential equations of order parameters that depict learning dynamics. We also show how to derive the generalization error by solving the differential equations of order parameters. On the basis of the results, we show that Cho's results are also apply in general cases and show some general performances of Cho's model.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.06953/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/1706.06953/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/1706.06953/full.md

---
Source: https://tomesphere.com/paper/1706.06953