A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning   Algorithm

Kwadwo Osei Bonsu

arXiv:2408.04911·cs.LG·August 12, 2024

A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm

Kwadwo Osei Bonsu

PDF

TL;DR

This paper introduces a geometric Nash approach to optimize the learning rate in Q-learning by analyzing the relationship between total time steps and reward vectors, improving learning stability and efficiency.

Contribution

It presents a novel geometric framework using Nash Equilibrium and angular bisectors to systematically estimate the learning rate in Q-learning.

Findings

01

Relationship between learning rate and angle between T and R vectors

02

Angular bisector concept aids in estimating optimal alpha

03

Enhanced stability and efficiency in Q-learning

Abstract

This paper proposes a geometric approach for estimating the $α$ value in Q learning. We establish a systematic framework that optimizes the {\alpha} parameter, thereby enhancing learning efficiency and stability. Our results show that there is a relationship between the learning rate and the angle between a vector T (total time steps in each episode of learning) and R (the reward vector for each episode). The concept of angular bisector between vectors T and R and Nash Equilibrium provide insight into estimating $α$ such that the algorithm minimizes losses arising from exploration-exploitation trade-off.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.