# KF-LAX: Kronecker-factored curvature estimation for control variate   optimization in reinforcement learning

**Authors:** Mohammad Firouzi

arXiv: 1812.04181 · 2018-12-12

## TL;DR

This paper introduces KF-LAX, a method that applies Kronecker-factored curvature estimation to improve sample efficiency and reduce variance in gradient-based reinforcement learning optimization.

## Contribution

It combines KFAC with RELAX for control variate optimization, enhancing sample efficiency in reinforcement learning tasks.

## Key findings

- Improved sample efficiency in synthetic and Atari tasks
- Reduced variance in gradient estimates
- Enhanced performance over baseline methods

## Abstract

A key challenge for gradient based optimization methods in model-free reinforcement learning is to develop an approach that is sample efficient and has low variance. In this work, we apply Kronecker-factored curvature estimation technique (KFAC) to a recently proposed gradient estimator for control variate optimization, RELAX, to increase the sample efficiency of using this gradient estimation method in reinforcement learning. The performance of the proposed method is demonstrated on a synthetic problem and a set of three discrete control task Atari games.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.04181/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1812.04181/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/1812.04181/full.md

---
Source: https://tomesphere.com/paper/1812.04181