Hunting Vulnerability Variants in AI Infra: Measurement and Reference-Driven Detection

Tian Dong; Yanjun Chen; Shoufeng Zhang; Huaien Zhang; Yunlong Lyu; Keke Lian; Dong Zhang; Shaofeng Li; Hao Chen

arXiv:2605.20051·cs.CR·May 20, 2026

Hunting Vulnerability Variants in AI Infra: Measurement and Reference-Driven Detection

Tian Dong, Yanjun Chen, Shoufeng Zhang, Huaien Zhang, Yunlong Lyu, Keke Lian, Dong Zhang, Shaofeng Li, Hao Chen

PDF

TL;DR

This study analyzes the prevalence of vulnerability variants in AI infrastructure repositories and introduces INFRASCOPE, a framework for detecting such variants to improve security.

Contribution

The paper provides the first measurement of vulnerability variants in AI infra and proposes INFRASCOPE for automatic detection based on known disclosures.

Findings

01

AI infra projects often share vulnerable patterns and functionality.

02

INFRASCOPE successfully identified over 20 vulnerabilities in real-world repositories.

03

The framework uncovered 11 acknowledged vulnerabilities and 4 CVE-assigned cases.

Abstract

AI infra has become a shared execution layer for model training, deployment, and agent orchestration. Because many projects reimplement similar model-centric workflows, a vulnerability disclosed in one repository can recur as a variant in another repository with a related design. Yet the prevalence and detectability of these variants remain poorly understood. This paper presents a measurement study of vulnerability variants in AI infra. Analyzing 688 GitHub repositories and 251 publicly disclosed vulnerabilities, we find that AI infra projects frequently share overlapping functionality and recurrent vulnerable patterns, creating a concrete basis for cross-repository variants. Building on this finding, we study how to automatically identify such variants from known disclosures. We propose INFRASCOPE, a reference-driven multi-agent framework that extracts transferable vulnerability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.