AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems
Mohd Safwan Uddin, Saba Hajira

TL;DR
AegisUI introduces a framework for detecting behavioral anomalies in dynamically generated user interfaces of AI agents, using feature extraction and machine learning to identify malicious payloads across multiple attack types.
Contribution
The paper presents a novel benchmarking framework and dataset for behavioral anomaly detection in UI payloads, comparing multiple machine learning detectors in this context.
Findings
Random Forest achieved highest accuracy (0.931) and precision (0.980).
Autoencoder detects anomalies without malicious labels during training.
Layout abuse is easiest to detect, manipulative UI is hardest.
Abstract
AI agents that build user interfaces on the fly assembling buttons, forms, and data displays from structured protocol payloads are becoming common in production systems. The trouble is that a payload can pass every schema check and still trick a user: a button might say "View invoice" while its hidden action wipes an account, or a display widget might quietly bind to an internal salary field. Current defenses stop at syntax; they were never built to catch this kind of behavioral mismatch. We built AegisUI to study exactly this gap. The framework generates structured UI payloads, injects realistic attacks into them, extracts numeric features, and benchmarks anomaly detectors end-to-end. We produced 4000 labeled payloads (3000 benign, 1000 malicious) spanning five application domains and five attack families: phishing interfaces, data leakage, layout abuse, manipulative UI, and workflow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Security and Verification in Computing
