The World Is Bigger! A Computationally-Embedded Perspective on the Big World Hypothesis
Alex Lewandowski, Adtiya A. Ramesh, Edan Meyer, Dale Schuurmans, Marlos C. Machado

TL;DR
This paper introduces a formal framework for understanding continual learning through the lens of an embedded automaton within an environment, emphasizing the importance of adaptability over fixed solutions.
Contribution
It formalizes the concept of an embedded agent as an automaton in a universal computer, linking it to POMDPs, and proposes an interactivity objective for continual learning evaluation.
Findings
Deep nonlinear networks struggle to sustain interactivity.
Deep linear networks maintain higher interactivity with increased capacity.
The proposed model-based RL algorithm effectively evaluates continual learning capabilities.
Abstract
Continual learning is often motivated by the idea, known as the big world hypothesis, that "the world is bigger" than the agent. Recent problem formulations capture this idea by explicitly constraining an agent relative to the environment. These constraints lead to solutions in which the agent continually adapts to best use its limited capacity, rather than converging to a fixed solution. However, explicit constraints can be ad hoc, difficult to incorporate, and may limit the effectiveness of scaling up the agent's capacity. In this paper, we characterize a problem setting in which an agent, regardless of its capacity, is constrained by being embedded in the environment. In particular, we introduce a computationally-embedded perspective that represents an embedded agent as an automaton simulated within a universal (formal) computer. Such an automaton is always constrained; we prove that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Game Theory and Applications
