PID-Inspired Inductive Biases for Deep Reinforcement Learning in   Partially Observable Control Tasks

Ian Char; Jeff Schneider

arXiv:2307.05891·cs.LG·October 27, 2023·1 cites

PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks

Ian Char, Jeff Schneider

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces PID-inspired history encoding architectures for deep reinforcement learning in partially observable control tasks, improving robustness and performance across tracking and locomotion benchmarks.

Contribution

It proposes novel PID-inspired history encoders for deep RL, enhancing robustness and performance in partially observable control tasks.

Findings

01

Encoders produce more robust policies than prior methods.

02

Achieve 1.7x better average performance on locomotion tasks.

03

Improve tracking task performance with PID-based features.

Abstract

Deep reinforcement learning (RL) has shown immense potential for learning to control systems through data alone. However, one challenge deep RL faces is that the full state of the system is often not observable. When this is the case, the policy needs to leverage the history of observations to infer the current state. At the same time, differences between the training and testing environments makes it critical for the policy not to overfit to the sequence of observations it sees at training time. As such, there is an important balancing act between having the history encoder be flexible enough to extract relevant information, yet be robust to changes in the environment. To strike this balance, we look to the PID controller for inspiration. We assert the PID controller's success shows that only summing and differencing are needed to accumulate information over time for many control…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics