Generalization of Agent Behavior through Explicit Representation of Context
Cem C Tutum, Suhaib Abdulquddos, Risto Miikkulainen

TL;DR
This paper introduces a context-aware approach for autonomous agents that coevolves a context module with a skill module, enabling better generalization and robustness in unseen environments like video games and autonomous driving.
Contribution
It presents a novel coevolution framework where a context module recognizes temporal variations and modulates skills, improving agents' ability to extrapolate beyond training data.
Findings
Significantly improved robustness in unseen environments
Effective in video games and autonomous driving simulations
Enables agents to extrapolate beyond training variations
Abstract
In order to deploy autonomous agents in digital interactive environments, they must be able to act robustly in unseen situations. The standard machine learning approach is to include as much variation as possible into training these agents. The agents can then interpolate within their training, but they cannot extrapolate much beyond it. This paper proposes a principled approach where a context module is coevolved with a skill module in the game. The context module recognizes the temporal variation in the game and modulates the outputs of the skill module so that the action decisions can be made robustly even in previously unseen situations. The approach is evaluated in the Flappy Bird and LunarLander video games, as well as in the CARLA autonomous driving simulation. The Context+Skill approach leads to significantly more robust behavior in environments that require extrapolation beyond…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Human Pose and Action Recognition
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
