Transfer Learning versus Multi-agent Learning regarding Distributed Decision-Making in Highway Traffic
Mark Schutera, Niklas Goby, Dirk Neumann, Markus Reischl

TL;DR
This paper compares transfer learning and multi-agent learning in mixed-intelligence highway traffic scenarios using deep reinforcement learning within the MIT DeepTraffic simulation, analyzing their impact on traffic flow and decision-making.
Contribution
It introduces a comparative analysis of transfer learning versus multi-agent learning for distributed decision-making in mixed traffic using deep reinforcement learning.
Findings
Transfer learning can effectively initialize multi-agent systems.
Multi-agent learning adapts better to high ratios of AI-driven traffic.
Traffic flow varies significantly with different learning strategies.
Abstract
Transportation and traffic are currently undergoing a rapid increase in terms of both scale and complexity. At the same time, an increasing share of traffic participants are being transformed into agents driven or supported by artificial intelligence resulting in mixed-intelligence traffic. This work explores the implications of distributed decision-making in mixed-intelligence traffic. The investigations are carried out on the basis of an online-simulated highway scenario, namely the MIT \emph{DeepTraffic} simulation. In the first step traffic agents are trained by means of a deep reinforcement learning approach, being deployed inside an elitist evolutionary algorithm for hyperparameter search. The resulting architectures and training parameters are then utilized in order to either train a single autonomous traffic agent and transfer the learned weights onto a multi-agent scenario or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Fault Detection and Control Systems
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
