Learning to Perform Physics Experiments via Deep Reinforcement Learning
Misha Denil, Pulkit Agrawal, Tejas D Kulkarni, Tom Erez, Peter, Battaglia, Nando de Freitas

TL;DR
This paper demonstrates that deep reinforcement learning agents can actively perform experiments in simulated environments to infer hidden physical properties of objects, mimicking human scientific exploration.
Contribution
It introduces a set of interactive tasks for agents to learn physical properties and shows that deep RL can develop strategies balancing information gathering and error costs.
Findings
Agents successfully infer properties like mass and cohesion.
Strategies vary with difficulty and cost constraints.
Deep RL can emulate scientific experimentation processes.
Abstract
When encountering novel objects, humans are able to infer a wide range of physical properties such as mass, friction and deformability by interacting with them in a goal driven way. This process of active interaction is in the same spirit as a scientist performing experiments to discover hidden facts. Recent advances in artificial intelligence have yielded machines that can achieve superhuman performance in Go, Atari, natural language processing, and complex control problems; however, it is not clear that these systems can rival the scientific intuition of even a young child. In this work we introduce a basic set of tasks that require agents to estimate properties such as mass and cohesion of objects in an interactive simulated environment where they can manipulate the objects and observe the consequences. We found that state of art deep reinforcement learning methods can learn to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Anomaly Detection Techniques and Applications · Robot Manipulation and Learning
