A Digital Twin Framework for Reinforcement Learning with Real-Time Self-Improvement via Human Assistive Teleoperation
Kabirat Olayemi, Mien Van, Luke Maguire, Sean McLoone

TL;DR
This paper presents a digital twin framework for reinforcement learning that enables real-time self-improvement of models through human assistive teleoperation, enhancing training efficiency and adaptability in robotic systems.
Contribution
It introduces a human-in-the-loop DRL framework using digital twins, combining simulation-based pre-training with real-time human-guided retraining for improved performance.
Findings
Outperforms baseline models in reward accumulation
Demonstrates superior training efficiency
Effective in both simulation and real-world environments
Abstract
Reinforcement Learning (RL) or Deep Reinforcement Learning (DRL) is a powerful approach to solving Markov Decision Processes (MDPs) when the model of the environment is not known a priori. However, RL models are still faced with challenges such as handling covariate shifts and ensuring the quality of human demonstration. To address these challenges and further advance DRL models, our work develops a human-in-the-loop DRL framework via digital twin that leverages human intelligence after deployment to retrain the DRL model in real time. First, we develop a pre-trained model fully based on learning through trial and error in the simulated environment allowing scalability and automation while eliminating variability and biases that can come from subjective human guidance. Second, instead of deploying the trained model directly on the UGV, we create a digital twin which controls the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Transformation in Industry
