Transferring Multiple Policies to Hotstart Reinforcement Learning in an   Air Compressor Management Problem

H\'el\`ene Plisnier; Denis Steckelmacher; Jeroen Willems; Bruno; Depraetere; Ann Now\'e

arXiv:2301.12820·cs.AI·January 31, 2023

Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem

H\'el\`ene Plisnier, Denis Steckelmacher, Jeroen Willems, Bruno, Depraetere, Ann Now\'e

PDF

Open Access

TL;DR

This paper introduces a method to transfer multiple pre-trained policies to accelerate reinforcement learning for new but similar industrial machine control tasks, demonstrated on an air compressor management problem.

Contribution

It applies Policy Intersection to transfer knowledge from several controllers, improving learning speed and performance in compressor control tasks.

Findings

01

Outperforms loading a single old controller.

02

Significantly improves long-term performance.

03

Reduces training time and resources.

Abstract

Many instances of similar or almost-identical industrial machines or tools are often deployed at once, or in quick succession. For instance, a particular model of air compressor may be installed at hundreds of customers. Because these tools perform distinct but highly similar tasks, it is interesting to be able to quickly produce a high-quality controller for machine $N + 1$ given the controllers already produced for machines $1.. N$ . This is even more important when the controllers are learned through Reinforcement Learning, as training takes time, energy and other resources. In this paper, we apply Policy Intersection, a Policy Shaping method, to help a Reinforcement Learning agent learn to solve a new variant of a compressors control problem faster, by transferring knowledge from several previously learned controllers. We show that our approach outperforms loading an old controller, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics