Graph-based Reinforcement Learning meets Mixed Integer Programs: An   application to 3D robot assembly discovery

Niklas Funk; Svenja Menzenbach; Georgia Chalvatzaki; Jan Peters

arXiv:2203.04120·cs.RO·August 3, 2022·1 cites

Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery

Niklas Funk, Svenja Menzenbach, Georgia Chalvatzaki, Jan Peters

PDF

Open Access

TL;DR

This paper presents a hierarchical method combining mixed-integer programming and graph-based reinforcement learning to efficiently assemble 3D structures with robots, demonstrating improved robustness and real-world transfer.

Contribution

It introduces a novel hierarchical framework integrating mixed-integer programming and graph-based RL for robot assembly tasks, enabling better generalization and structural feasibility considerations.

Findings

01

Outperforms unstructured end-to-end approaches in simulations

02

Demonstrates successful real-world transfer of the method

03

Shows robustness and efficiency in complex assembly tasks

Abstract

Robot assembly discovery is a challenging problem that lives at the intersection of resource allocation and motion planning. The goal is to combine a predefined set of objects to form something new while considering task execution with the robot-in-the-loop. In this work, we tackle the problem of building arbitrary, predefined target structures entirely from scratch using a set of Tetris-like building blocks and a robotic manipulator. Our novel hierarchical approach aims at efficiently decomposing the overall task into three feasible levels that benefit mutually from each other. On the high level, we run a classical mixed-integer program for global optimization of block-type selection and the blocks' final poses to recreate the desired shape. Its output is then exploited to efficiently guide the exploration of an underlying reinforcement learning (RL) policy. This RL policy draws its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Manufacturing Process and Optimization · Reinforcement Learning in Robotics

MethodsQ-Learning