Theoretically Principled Deep RL Acceleration via Nearest Neighbor   Function Approximation

Junhong Shen; Lin F. Yang

arXiv:2110.04422·cs.LG·October 12, 2021

Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation

Junhong Shen, Lin F. Yang

PDF

Open Access 1 Video

TL;DR

This paper introduces a nearest neighbor function approximator for deep reinforcement learning that improves sample efficiency and stability, supported by theoretical guarantees and practical experiments.

Contribution

It proposes a theoretically grounded NN approximator for deep RL, including a new online policy algorithm and a plug-and-play module for existing methods.

Findings

01

Higher sample efficiency in control and locomotion tasks

02

Enhanced stability of deep RL agents

03

Theoretical regret bounds depending on environment complexity

Abstract

Recently, deep reinforcement learning (RL) has achieved remarkable empirical success by integrating deep neural networks into RL frameworks. However, these algorithms often require a large number of training samples and admit little theoretical understanding. To mitigate these issues, we propose a theoretically principled nearest neighbor (NN) function approximator that can improve the value networks in deep RL methods. Inspired by human similarity judgments, the NN approximator estimates the action values using rollouts on past observations and can provably obtain a small regret bound that depends only on the intrinsic complexity of the environment. We present (1) Nearest Neighbor Actor-Critic (NNAC), an online policy gradient algorithm that demonstrates the practicality of combining function approximation with deep RL, and (2) a plug-and-play NN update module that aids the training of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Theoretically Principled Deep Rl Acceleration via Nearest Neighbor Function Approximation· underline

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)