Multi-Agent Transfer Learning in Reinforcement Learning-Based Ride-Sharing Systems
Alberto Castagna, Ivana Dusparic

TL;DR
This paper explores how transfer learning parameters affect multi-agent reinforcement learning in ride-sharing systems, aiming to improve adaptation speed and efficiency by dynamically selecting transfer content based on agent confidence levels.
Contribution
It investigates the impact of fixed transfer parameters and confidence-based filtering in multi-agent RL for ride-sharing, advancing understanding of transfer learning in dynamic multi-agent environments.
Findings
Transfer parameters significantly influence learning efficiency.
Confidence-based filtering improves transfer relevance.
Results demonstrate potential for faster adaptation in ride-sharing systems.
Abstract
Reinforcement learning (RL) has been used in a range of simulated real-world tasks, e.g., sensor coordination, traffic light control, and on-demand mobility services. However, real world deployments are rare, as RL struggles with dynamic nature of real world environments, requiring time for learning a task and adapting to changes in the environment. Transfer Learning (TL) can help lower these adaptation times. In particular, there is a significant potential of applying TL in multi-agent RL systems, where multiple agents can share knowledge with each other, as well as with new agents that join the system. To obtain the most from inter-agent transfer, transfer roles (i.e., determining which agents act as sources and which as targets), as well as relevant transfer content parameters (e.g., transfer size) should be selected dynamically in each particular situation. As a first step towards…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Mobile Crowdsensing and Crowdsourcing · Reinforcement Learning in Robotics
