Learning to Perform Physics Experiments via Deep Reinforcement Learning

Misha Denil; Pulkit Agrawal; Tejas D Kulkarni; Tom Erez; Peter; Battaglia; Nando de Freitas

arXiv:1611.01843·stat.ML·August 21, 2017·55 cites

Learning to Perform Physics Experiments via Deep Reinforcement Learning

Misha Denil, Pulkit Agrawal, Tejas D Kulkarni, Tom Erez, Peter, Battaglia, Nando de Freitas

PDF

Open Access

TL;DR

This paper demonstrates that deep reinforcement learning agents can actively perform experiments in simulated environments to infer hidden physical properties of objects, mimicking human scientific exploration.

Contribution

It introduces a set of interactive tasks for agents to learn physical properties and shows that deep RL can develop strategies balancing information gathering and error costs.

Findings

01

Agents successfully infer properties like mass and cohesion.

02

Strategies vary with difficulty and cost constraints.

03

Deep RL can emulate scientific experimentation processes.

Abstract

When encountering novel objects, humans are able to infer a wide range of physical properties such as mass, friction and deformability by interacting with them in a goal driven way. This process of active interaction is in the same spirit as a scientist performing experiments to discover hidden facts. Recent advances in artificial intelligence have yielded machines that can achieve superhuman performance in Go, Atari, natural language processing, and complex control problems; however, it is not clear that these systems can rival the scientific intuition of even a young child. In this work we introduce a basic set of tasks that require agents to estimate properties such as mass and cohesion of objects in an interactive simulated environment where they can manipulate the objects and observe the consequences. We found that state of art deep reinforcement learning methods can learn to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Anomaly Detection Techniques and Applications · Robot Manipulation and Learning