Partial Attention in Deep Reinforcement Learning for Safe Multi-Agent Control

Turki Bin Mohaya; Peter Seiler

arXiv:2603.21810·eess.SY·March 24, 2026

Partial Attention in Deep Reinforcement Learning for Safe Multi-Agent Control

Turki Bin Mohaya, Peter Seiler

PDF

Open Access

TL;DR

This paper introduces a partial attention mechanism within a QMIX framework for multi-agent deep reinforcement learning to improve safety and efficiency in autonomous vehicle highway merging scenarios.

Contribution

It presents a novel neural network design incorporating partial attention for multi-agent control in Dec-POMDP environments, enhancing safety and performance.

Findings

01

Improved safety metrics in simulations

02

Higher driving speeds achieved

03

Enhanced reward optimization

Abstract

Attention mechanisms excel at learning sequential patterns by discriminating data based on relevance and importance. This provides state-of-the-art performance in advanced generative artificial intelligence models. This paper applies this concept of an attention mechanism for multi-agent safe control. We specifically consider the design of a neural network to control autonomous vehicles in a highway merging scenario. The environment is modeled as a Decentralized Partially Observable Markov Decision Process (Dec-POMDP). Within a QMIX framework, we include partial attention for each autonomous vehicle, thus allowing each ego vehicle to focus on the most relevant neighboring vehicles. Moreover, we propose a comprehensive reward signal that considers the global objectives of the environment (e.g., safety and vehicle flow) and the individual interests of each agent. Simulations are conducted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Traffic control and management