GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization

Sihan Zhou; Tiantian He; Yifan Lu; Yaqing Hou; Yew-Soon Ong

arXiv:2604.00717·cs.MA·April 2, 2026

GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization

Sihan Zhou, Tiantian He, Yifan Lu, Yaqing Hou, Yew-Soon Ong

PDF

TL;DR

GRASP introduces a novel multi-agent optimization framework that actively aligns gradients to stabilize policy evolution, improving convergence speed and collaboration in complex environments.

Contribution

It proposes a new active perception mechanism using consensus gradients and proves equilibrium existence with Kakutani Fixed-Point Theorem.

Findings

01

Demonstrates scalability on SMAC and GRF benchmarks.

02

Achieves promising performance and faster convergence.

03

Provides theoretical guarantees for equilibrium stability.

Abstract

Non-stationarity arises from concurrent policy updates and leads to persistent environmental fluctuations. Existing approaches like Centralized Training with Decentralized Execution (CTDE) and sequential update schemes mitigate this issue. However, since the perception of the policies of other agents remains dependent on sampling environmental interaction data, the agent essentially operates in a passive perception state. This inevitably triggers equilibrium oscillations and significantly slows the convergence speed of the system. To address this issue, we propose Gradient Realignment via Active Shared Perception (GRASP), a novel framework that defines generalized Bellman equilibrium as a stable objective for policy evolution. The core mechanism of GRASP involves utilizing the independent gradients of agents to derive a defined consensus gradient, enabling agents to actively perceive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.