Enabling First-Order Gradient-Based Learning for Equilibrium Computation   in Markets

Nils Kohring; Fabian R. Pieroth; Martin Bichler

arXiv:2303.09500·cs.GT·March 17, 2023·1 cites

Enabling First-Order Gradient-Based Learning for Equilibrium Computation in Markets

Nils Kohring, Fabian R. Pieroth, Martin Bichler

PDF

Open Access 1 Repo

TL;DR

This paper introduces a smoothing technique for differentiable market simulations, enabling efficient first-order gradient-based equilibrium computation, which outperforms zeroth-order methods in accuracy and speed.

Contribution

The paper proposes a novel smoothing method that makes non-differentiable market simulations amenable to first-order gradient methods, with theoretical bias bounds and empirical validation.

Findings

01

The smoothing technique reduces variance in gradient estimates.

02

First-order methods outperform zeroth-order in accuracy.

03

The approach improves computational efficiency in equilibrium computation.

Abstract

Understanding and analyzing markets is crucial, yet analytical equilibrium solutions remain largely infeasible. Recent breakthroughs in equilibrium computation rely on zeroth-order policy gradient estimation. These approaches commonly suffer from high variance and are computationally expensive. The use of fully differentiable simulators would enable more efficient gradient estimation. However, the discrete allocation of goods in economic simulations is a non-differentiable operation. This renders the first-order Monte Carlo gradient estimator inapplicable and the learning feedback systematically misleading. We propose a novel smoothing technique that creates a surrogate market game, in which first-order methods can be applied. We provide theoretical bounds on the resulting bias which justifies solving the smoothed game instead. These bounds also allow choosing the smoothing strength a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

heidekrueger/bnelearn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Model Reduction and Neural Networks