Surrogate Empowered Sim2Real Transfer of Deep Reinforcement Learning for ORC Superheat Control
Runze Lin, Yangyang Luo, Xialai Wu, Junghui Chen, Biao Huang, Lei Xie,, Hongye Su

TL;DR
This paper introduces a Sim2Real transfer learning approach for deep reinforcement learning in ORC superheat control, enhancing training efficiency and generalization across varying operating conditions in energy systems.
Contribution
It presents a novel Sim2Real transfer learning method that improves DRL training speed and robustness for ORC control, addressing safety and generalization challenges.
Findings
Significantly faster DRL training in ORC control tasks.
Enhanced generalization of the DRL agent across multiple operating modes.
Effective mitigation of safety risks associated with physical system testing.
Abstract
The Organic Rankine Cycle (ORC) is widely used in industrial waste heat recovery due to its simple structure and easy maintenance. However, in the context of smart manufacturing in the process industry, traditional model-based optimization control methods are unable to adapt to the varying operating conditions of the ORC system or sudden changes in operating modes. Deep reinforcement learning (DRL) has significant advantages in situations with uncertainty as it directly achieves control objectives by interacting with the environment without requiring an explicit model of the controlled plant. Nevertheless, direct application of DRL to physical ORC systems presents unacceptable safety risks, and its generalization performance under model-plant mismatch is insufficient to support ORC control requirements. Therefore, this paper proposes a Sim2Real transfer learning-based DRL control method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBuilding Energy and Comfort Optimization · Fuel Cells and Related Materials · Advanced Control Systems Optimization
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
