Metacontrol for Adaptive Imagination-Based Optimization
Jessica B. Hamrick, Andrew J. Ballard, Razvan Pascanu, Oriol Vinyals,, Nicolas Heess, Peter W. Battaglia

TL;DR
This paper introduces a metacontroller that adaptively allocates computational resources by imagining internal simulations, improving efficiency in solving complex tasks with varying difficulty levels.
Contribution
It proposes a novel reinforcement learning framework that learns to decide the number of simulation steps and which models to consult, optimizing computational efficiency.
Findings
Metacontroller adapts computation based on task difficulty.
It learns to select experts considering reliability and cost.
Achieves lower total cost compared to fixed policies.
Abstract
Many machine learning systems are built to solve the hardest examples of a particular task, which often makes them large and expensive to run---especially with respect to the easier examples, which might require much less computation. For an agent with a limited computational budget, this "one-size-fits-all" approach may result in the agent wasting valuable computation on easy examples, while not spending enough on hard examples. Rather than learning a single, fixed policy for solving all instances of a task, we introduce a metacontroller which learns to optimize a sequence of "imagined" internal simulations over predictive models of the world in order to construct a more informed, and more economical, solution. The metacontroller component is a model-free reinforcement learning agent, which decides both how many iterations of the optimization procedure to run, as well as which model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Machine Learning and Data Classification
