GO Hessian for Expectation-Based Objectives

Yulai Cong; Miaoyun Zhao; Jianqiao Li; Junya Chen; Lawrence Carin

arXiv:2006.08873·stat.ML·June 17, 2020

GO Hessian for Expectation-Based Objectives

Yulai Cong, Miaoyun Zhao, Jianqiao Li, Junya Chen, Lawrence Carin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces GO Hessian, an unbiased low-variance estimator for second-order derivatives of expectation-based objectives, facilitating efficient curvature-based optimization in stochastic computation graphs.

Contribution

The paper extends the GO gradient to a Hessian estimator, enabling practical second-order optimization for expectation objectives involving complex stochastic nodes.

Findings

01

GO Hessian is easy to implement with auto-differentiation.

02

It provides efficient curvature information for non-reparameterizable distributions.

03

Experimental results show improved optimization performance.

Abstract

An unbiased low-variance gradient estimator, termed GO gradient, was proposed recently for expectation-based objectives $E_{q_{γ} (y)} [f (y)]$ , where the random variable (RV) $y$ may be drawn from a stochastic computation graph with continuous (non-reparameterizable) internal nodes and continuous/discrete leaves. Upgrading the GO gradient, we present for $E_{q_{γ} (y)} [f (y)]$ an unbiased low-variance Hessian estimator, named GO Hessian. Considering practical implementation, we reveal that GO Hessian is easy-to-use with auto-differentiation and Hessian-vector products, enabling efficient cheap exploitation of curvature information over stochastic computation graphs. As representative examples, we present the GO Hessian for non-reparameterizable gamma and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YulaiCong/GOHessian
pytorchOfficial

Videos

GO Hessian for Expectation-Based Objectives· underline

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference