Bilevel Optimization for Covert Memory Tampering in Heterogeneous Multi-Agent Architectures (XAMT)
Akhil Sharma, Shaikh Yaser Arafat, Jai Kumar Sharma, Ken Huang

TL;DR
This paper introduces XAMT, a bilevel optimization framework that generates covert memory tampering attacks on heterogeneous multi-agent systems, highlighting vulnerabilities in shared memory components like experience buffers and knowledge bases.
Contribution
It formalizes a novel bilevel optimization approach for stealthy memory tampering attacks in heterogeneous MAS, unifying attack strategies across MARL and RAG-based agents.
Findings
Effective at sub-percent poison rates in benchmarks
Creates stealthy, minimal-perturbation attacks
Evades detection heuristics successfully
Abstract
The increasing operational reliance on complex Multi-Agent Systems (MAS) across safety-critical domains necessitates rigorous adversarial robustness assessment. Modern MAS are inherently heterogeneous, integrating conventional Multi-Agent Reinforcement Learning (MARL) with emerging Large Language Model (LLM) agent architectures utilizing Retrieval-Augmented Generation (RAG). A critical shared vulnerability is reliance on centralized memory components: the shared Experience Replay (ER) buffer in MARL and the external Knowledge Base (K) in RAG agents. This paper proposes XAMT (Bilevel Optimization for Covert Memory Tampering in Heterogeneous Multi-Agent Architectures), a novel framework that formalizes attack generation as a bilevel optimization problem. The Upper Level minimizes perturbation magnitude (delta) to enforce covertness while maximizing system behavior divergence toward an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Network Security and Intrusion Detection
