Geometrical Regret Matching

Sizhong Lan

arXiv:1908.09021·cs.GT·January 24, 2020

Geometrical Regret Matching

Sizhong Lan

PDF

Open Access

TL;DR

This paper introduces a geometrical regret matching method that ensures smooth strategy updates, leading to a more natural convergence towards Nash equilibrium compared to traditional jumpy methods.

Contribution

It proposes a new geometrical approach to regret matching that provides smooth strategy updates, improving the convergence process towards Nash equilibrium.

Findings

01

Smooth strategy updates facilitate convergence to Nash equilibrium.

02

The geometrical method offers a clear, visualizable path towards equilibrium.

03

Limitations exist in optimizing approximation accuracy.

Abstract

We argue that the existing regret matchings for Nash equilibrium approximation conduct "jumpy" strategy updating when the probabilities of future plays are set to be proportional to positive regret measures. We propose a geometrical regret matching which features "smooth" strategy updating. Our approach is simple, intuitive and natural. The analytical and numerical results show that, continuously and "smoothly" suppressing "unprofitable" pure strategies is sufficient for the game to evolve towards Nash equilibrium, suggesting that in reality the tendency for equilibrium could be pervasive and irresistible. Technically, iterative regret matching gives rise to a sequence of adjusted mixed strategies for our study its approximation to the true equilibrium point. The sequence can be studied in metric space and visualized nicely as a clear path towards an equilibrium point. Our theory has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Artificial Intelligence in Games · Advanced Bandit Algorithms Research