A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in   Model Repositories

Ziqi Ding; Qian Fu; Junchen Ding; Gelei Deng; Yi Liu and; Yuekang Li

arXiv:2505.01067·cs.CR·May 5, 2025

A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories

Ziqi Ding, Qian Fu, Junchen Ding, Gelei Deng, Yi Liu and, Yuekang Li

PDF

Open Access

TL;DR

This paper investigates security vulnerabilities in AI model repositories, especially malicious configurations on Hugging Face, and introduces CONFIGSCAN, an LLM-based tool for detecting suspicious configurations with high accuracy.

Contribution

It provides the first comprehensive analysis of malicious configurations in AI repositories and proposes CONFIGSCAN, a novel LLM-based detection method for security threats.

Findings

01

Thousands of suspicious repositories identified

02

CONFIGSCAN achieves low false positives and high detection accuracy

03

Highlights urgent need for security validation in AI hosting platforms

Abstract

Recent advancements in large language models (LLMs) have spurred the development of diverse AI applications from code generation and video editing to text generation; however, AI supply chains such as Hugging Face, which host pretrained models and their associated configuration files contributed by the public, face significant security challenges; in particular, configuration files originally intended to set up models by specifying parameters and initial settings can be exploited to execute unauthorized code, yet research has largely overlooked their security compared to that of the models themselves; in this work, we present the first comprehensive study of malicious configurations on Hugging Face, identifying three attack scenarios (file, website, and repository operations) that expose inherent risks; to address these threats, we introduce CONFIGSCAN, an LLM-based tool that analyzes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Ethics and Social Impacts of AI

MethodsSparse Evolutionary Training