Nightmare Dreamer: Dreaming About Unsafe States And Planning Ahead

Oluwatosin Oseni; Shengjie Wang; Jun Zhu; and Micah Corah

arXiv:2601.04686·cs.LG·January 9, 2026

Nightmare Dreamer: Dreaming About Unsafe States And Planning Ahead

Oluwatosin Oseni, Shengjie Wang, Jun Zhu, and Micah Corah

PDF

Open Access

TL;DR

Nightmare Dreamer is a model-based Safe Reinforcement Learning algorithm that predicts safety violations using a learned world model, significantly reducing safety violations while maintaining high reward performance in robotics tasks.

Contribution

It introduces Nightmare Dreamer, a novel model-based Safe RL method that effectively predicts unsafe states and plans safely, outperforming existing model-free approaches.

Findings

01

Nearly zero safety violations achieved

02

20x efficiency improvement over baselines

03

Effective with only image observations

Abstract

Reinforcement Learning (RL) has shown remarkable success in real-world applications, particularly in robotics control. However, RL adoption remains limited due to insufficient safety guarantees. We introduce Nightmare Dreamer, a model-based Safe RL algorithm that addresses safety concerns by leveraging a learned world model to predict potential safety violations and plan actions accordingly. Nightmare Dreamer achieves nearly zero safety violations while maximizing rewards. Nightmare Dreamer outperforms model-free baselines on Safety Gymnasium tasks using only image observations, achieving nearly a 20x improvement in efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety