Coarse-Grained Smoothness for RL in Metric Spaces

Omer Gottesman; Kavosh Asadi; Cameron Allen; Sam Lobel; George; Konidaris; Michael Littman

arXiv:2110.12276·cs.LG·October 26, 2021

Coarse-Grained Smoothness for RL in Metric Spaces

Omer Gottesman, Kavosh Asadi, Cameron Allen, Sam Lobel, George, Konidaris, Michael Littman

PDF

Open Access

TL;DR

This paper introduces a new coarse-grained smoothness concept for Q-functions in reinforcement learning, addressing limitations of Lipschitz continuity and enabling tighter bounds for better learning in continuous spaces.

Contribution

It proposes a generalized smoothness definition that is more applicable than Lipschitz continuity, with theoretical analysis and implications for control and exploration.

Findings

01

New smoothness definition generalizes Lipschitz continuity

02

Tighter bounds on Q-functions improve learning efficiency

03

Enhanced control and exploration in continuous domains

Abstract

Principled decision-making in continuous state--action spaces is impossible without some assumptions. A common approach is to assume Lipschitz continuity of the Q-function. We show that, unfortunately, this property fails to hold in many typical domains. We propose a new coarse-grained smoothness definition that generalizes the notion of Lipschitz continuity, is more widely applicable, and allows us to compute significantly tighter bounds on Q-functions, leading to improved learning. We provide a theoretical analysis of our new smoothness definition, and discuss its implications and impact on control and exploration in continuous domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research