Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Rached Bouchoucha, Ahmed Haj Yahmed, Darshan Patil, Janarthanan, Rajendran, Amin Nikanjam, Sarath Chandar, Foutse Khomh

TL;DR
RLExplorer is a novel tool that automates fault diagnosis in deep reinforcement learning systems by monitoring training traces and identifying DRL-specific faults, significantly aiding developers in debugging complex DRL applications.
Contribution
This paper introduces RLExplorer, the first automated fault diagnosis approach specifically designed for deep reinforcement learning software systems.
Findings
Diagnosed real faults in 83% of faulty DRL samples from Stack Overflow.
Identified 3.6 times more defects than manual debugging.
Easily integrates into existing DRL applications.
Abstract
Deep reinforcement learning (DRL) has shown success in diverse domains such as robotics, computer games, and recommendation systems. However, like any other software system, DRL-based software systems are susceptible to faults that pose unique challenges for debugging and diagnosing. These faults often result in unexpected behavior without explicit failures and error messages, making debugging difficult and time-consuming. Therefore, automating the monitoring and diagnosis of DRL systems is crucial to alleviate the burden on developers. In this paper, we propose RLExplorer, the first fault diagnosis approach for DRL-based software systems. RLExplorer automatically monitors training traces and runs diagnosis routines based on properties of the DRL learning dynamics to detect the occurrence of DRL-specific faults. It then logs the results of these diagnoses as warnings that cover…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications
