Planning from Pixels in Environments with Combinatorially Hard Search   Spaces

Marco Bagatella; Mirek Ol\v{s}\'ak; Michal Rol\'inek; Georg Martius

arXiv:2110.06149·cs.LG·March 21, 2022

Planning from Pixels in Environments with Combinatorially Hard Search Spaces

Marco Bagatella, Mirek Ol\v{s}\'ak, Michal Rol\'inek, Georg Martius

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a novel approach that transforms raw visual input into a latent graph to enable efficient planning in environments with complex combinatorial structures, demonstrating strong generalization and robustness.

Contribution

It presents a method for environment representation as a latent graph with state reidentification, reducing planning complexity and introducing challenging environments for testing.

Findings

01

Achieves linear complexity in planning from visual input

02

Demonstrates strong generalization across environment variations

03

Performs well in one-shot planning and offline RL settings

Abstract

The ability to form complex plans based on raw visual input is a litmus test for current capabilities of artificial intelligence, as it requires a seamless combination of visual processing and abstract algorithmic execution, two traditionally separate areas of computer science. A recent surge of interest in this field brought advances that yield good performance in tasks ranging from arcade games to continuous control; these methods however do not come without significant issues, such as limited generalization capabilities and difficulties when dealing with combinatorially hard planning instances. Our contribution is two-fold: (i) we present a method that learns to represent its environment as a latent graph and leverages state reidentification to reduce the complexity of finding a good policy from exponential to linear (ii) we introduce a set of lightweight environments with an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Planning from Pixels in Environments with Combinatorially Hard Search Spaces· slideslive

Taxonomy

TopicsAI-based Problem Solving and Planning

MethodsTest