TL;DR
Ego-Foresight introduces a self-supervised method that enhances reinforcement learning by disentangling agent information through motion prediction, leading to improved sample efficiency and performance.
Contribution
The paper proposes Ego-Foresight, a novel self-supervised approach that models agent awareness via motion prediction, improving RL sample efficiency without requiring supervision.
Findings
EF improves sample efficiency in RL tasks.
EF successfully predicts agent movement and disentangles agent information.
Integration of EF enhances RL performance in simulated control tasks.
Abstract
Despite the significant advances in Deep Reinforcement Learning (RL) observed in the last decade, the amount of training experience necessary to learn effective policies remains one of the primary concerns in both simulated and real environments. Looking to solve this issue, previous work has shown that improved efficiency can be achieved by separately modeling the agent and environment, but usually requires a supervisory signal. In contrast to RL, humans can perfect a new skill from a small number of trials and often do so without a supervisory signal, making neuroscientific studies of human development a valuable source of inspiration for RL. In particular, we explore the idea of motor prediction, which states that humans develop an internal model of themselves and of the consequences that their motor commands have on the immediate sensory inputs. Our insight is that the movementofthe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
