Pretty darn good control: when are approximate solutions better than approximate models
Felipe Montealegre-Mora, Marcus Lapeyrolerie, Melissa Chapman, Abigail, G. Keller, Carl Boettiger

TL;DR
This paper explores when approximate solutions to simplified models outperform those to more accurate models, demonstrating DRL's ability to find effective, interpretable control policies in complex, real-world systems like fisheries.
Contribution
It shows that deep reinforcement learning can derive effective control policies directly from complex models without explicit process inference, outperforming traditional constant policies.
Findings
DRL discovers interpretable control rules in a fishery model
DRL policies outperform constant mortality policies in profitability
Deep RL can handle complex, nonlinear control problems without explicit model inference
Abstract
Existing methods for optimal control struggle to deal with the complexity commonly encountered in real-world systems, including dimensionality, process error, model bias and data heterogeneity. Instead of tackling these system complexities directly, researchers have typically sought to simplify models to fit optimal control methods. But when is the optimal solution to an approximate, stylized model better than an approximate solution to a more accurate model? While this question has largely gone unanswered owing to the difficulty of finding even approximate solutions for complex models, recent algorithmic and computational advances in deep reinforcement learning (DRL) might finally allow us to address these questions. DRL methods have to date been applied primarily in the context of games or robotic mechanics, which operate under precisely known rules. Here, we demonstrate the ability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Energy Load and Power Forecasting · Energy, Environment, and Transportation Policies
