Efficient Reinforcement Learning in Resource Allocation Problems Through   Permutation Invariant Multi-task Learning

Desmond Cai; Shiau Hong Lim; Laura Wynter

arXiv:2102.09361·cs.LG·February 19, 2021

Efficient Reinforcement Learning in Resource Allocation Problems Through Permutation Invariant Multi-task Learning

Desmond Cai, Shiau Hong Lim, Laura Wynter

PDF

Open Access

TL;DR

This paper introduces a neural network architecture and sampling strategy for multi-task reinforcement learning that leverages task invariance properties to significantly improve sample efficiency in resource allocation problems.

Contribution

The paper presents a novel multi-task learning approach exploiting invariance properties, with a theoretical performance bound and empirical validation on real-world tasks.

Findings

01

Enhanced sample efficiency in resource allocation RL tasks

02

Effective neural network architecture for invariant multi-task learning

03

Improved performance in financial portfolio optimization and federated learning

Abstract

One of the main challenges in real-world reinforcement learning is to learn successfully from limited training samples. We show that in certain settings, the available data can be dramatically increased through a form of multi-task learning, by exploiting an invariance property in the tasks. We provide a theoretical performance bound for the gain in sample efficiency under this setting. This motivates a new approach to multi-task learning, which involves the design of an appropriate neural network architecture and a prioritized task-sampling strategy. We demonstrate empirically the effectiveness of the proposed approach on two real-world sequential resource allocation tasks where this invariance property occurs: financial portfolio optimization and meta federated learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Smart Grid Energy Management