Characterizing the Action-Generalization Gap in Deep Q-Learning
Zhiyuan Zhou, Cameron Allen, Kavosh Asadi, George Konidaris

TL;DR
This paper investigates how deep Q-learning models generalize over actions in discrete spaces, proposing evaluation methods and analyzing the factors affecting their action generalization capabilities across different domains.
Contribution
It introduces a new evaluation approach for action generalization and characterizes the action-generalization gap in deep Q-learning across various domains.
Findings
DQN can generalize over actions in simple domains
Action generalization ability decreases as action space size increases
Action generalization accelerates learning when present
Abstract
We study the action generalization ability of deep Q-learning in discrete action spaces. Generalization is crucial for efficient reinforcement learning (RL) because it allows agents to use knowledge learned from past experiences on new tasks. But while function approximation provides deep RL agents with a natural way to generalize over state inputs, the same generalization mechanism does not apply to discrete action outputs. And yet, surprisingly, our experiments indicate that Deep Q-Networks (DQN), which use exactly this type of function approximator, are still able to achieve modest action generalization. Our main contribution is twofold: first, we propose a method of evaluating action generalization using expert knowledge of action similarity, and empirically confirm that action generalization leads to faster learning; second, we characterize the action-generalization gap (the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Memory and Neural Computing · Adversarial Robustness in Machine Learning
MethodsDense Connections · Convolution · Q-Learning · Deep Q-Network
