Reinforcement Learning for Assignment problem

Filipp Skomorokhov (1; 2); George Ovchinnikov (2) ((1) Moscow; Institute of Physics; Technology; (2) Skolkovo Institute of Science and; Technology)

arXiv:2011.03909·cs.AI·November 10, 2020·1 cites

Reinforcement Learning for Assignment problem

Filipp Skomorokhov (1, 2), George Ovchinnikov (2) ((1) Moscow, Institute of Physics, Technology, (2) Skolkovo Institute of Science and, Technology)

PDF

Open Access

TL;DR

This paper explores using reinforcement learning with neural networks to solve user scheduling problems, demonstrating improved performance over traditional greedy methods in a stochastic simulation environment.

Contribution

It introduces a Q-learning based approach tailored for dynamic user scheduling, outperforming analytical greedy solutions in simulated scenarios.

Findings

01

Q-learning outperforms greedy algorithms in total reward

02

The method adapts well to stochastic environment changes

03

Reinforcement learning reduces scheduling penalties

Abstract

This paper is dedicated to the application of reinforcement learning combined with neural networks to the general formulation of user scheduling problem. Our simulator resembles real world problems by means of stochastic changes in environment. We applied Q-learning based method to the number of dynamic simulations and outperformed analytical greedy-based solution in terms of total reward, the aim of which is to get the lowest possible penalty throughout simulation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Parking Systems Research · Smart Grid Energy Management · Transportation and Mobility Innovations

MethodsQ-Learning