Metacontrol for Adaptive Imagination-Based Optimization

Jessica B. Hamrick; Andrew J. Ballard; Razvan Pascanu; Oriol Vinyals,; Nicolas Heess; Peter W. Battaglia

arXiv:1705.02670·cs.LG·November 9, 2022·47 cites

Metacontrol for Adaptive Imagination-Based Optimization

Jessica B. Hamrick, Andrew J. Ballard, Razvan Pascanu, Oriol Vinyals,, Nicolas Heess, Peter W. Battaglia

PDF

Open Access 1 Repo

TL;DR

This paper introduces a metacontroller that adaptively allocates computational resources by imagining internal simulations, improving efficiency in solving complex tasks with varying difficulty levels.

Contribution

It proposes a novel reinforcement learning framework that learns to decide the number of simulation steps and which models to consult, optimizing computational efficiency.

Findings

01

Metacontroller adapts computation based on task difficulty.

02

It learns to select experts considering reliability and cost.

03

Achieves lower total cost compared to fixed policies.

Abstract

Many machine learning systems are built to solve the hardest examples of a particular task, which often makes them large and expensive to run---especially with respect to the easier examples, which might require much less computation. For an agent with a limited computational budget, this "one-size-fits-all" approach may result in the agent wasting valuable computation on easy examples, while not spending enough on hard examples. Rather than learning a single, fixed policy for solving all instances of a task, we introduce a metacontroller which learns to optimize a sequence of "imagined" internal simulations over predictive models of the world in order to construct a more informed, and more economical, solution. The metacontroller component is a model-free reinforcement learning agent, which decides both how many iterations of the optimization procedure to run, as well as which model to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deepmind/spaceship_dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Machine Learning and Data Classification