Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks

Toqeer Ali Syed; Mishal Ateeq Almutairi; Mahmoud Abdel Moaty

arXiv:2512.23557·cs.CR·December 30, 2025

Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks

Toqeer Ali Syed, Mishal Ateeq Almutairi, Mahmoud Abdel Moaty

PDF

Open Access

TL;DR

This paper introduces a multimodal framework that enhances the security and trustworthiness of agentic AI systems by detecting and preventing prompt injection attacks through provenance-aware sanitization and validation across multi-agent networks.

Contribution

It proposes a novel Cross-Agent Multimodal Provenance-Aware Defense Framework that sanitizes prompts and verifies outputs to prevent malicious instructions from propagating in agentic AI environments.

Findings

01

Improved multimodal injection detection accuracy

02

Reduced cross-agent trust leakage

03

Enhanced stability of agentic execution pathways

Abstract

Powerful autonomous systems, which reason, plan, and converse using and between numerous tools and agents, are made possible by Large Language Models (LLMs), Vision-Language Models (VLMs), and new agentic AI systems, like LangChain and GraphChain. Nevertheless, this agentic environment increases the probability of the occurrence of multimodal prompt injection (PI) attacks, in which concealed or malicious instructions carried in text, pictures, metadata, or agent-to-agent messages may spread throughout the graph and lead to unintended behavior, a breach of policy, or corruption of state. In order to mitigate these risks, this paper suggests a Cross-Agent Multimodal Provenanc- Aware Defense Framework whereby all the prompts, either user-generated or produced by upstream agents, are sanitized and all the outputs generated by an LLM are verified independently before being sent to downstream…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Big Data and Digital Economy · Multi-Agent Systems and Negotiation