Securing the AI Supply Chain: What Can We Learn From Developer-Reported Security Issues and Solutions of AI Projects?
The Anh Nguyen, Triet Huynh Minh Le, M. Ali Babar

TL;DR
This paper provides an empirical analysis of developer-reported security issues in AI projects, revealing common vulnerabilities and solutions across the AI supply chain to inform better security practices.
Contribution
It introduces a novel pipeline combining keyword matching and a fine-tuned distilBERT classifier to analyze a large dataset of security discussions in AI projects.
Findings
Identified 32 security issues and 24 solutions across four themes.
Many security issues stem from dependencies and the black-box nature of AI components.
Challenges related to Models and Data often lack concrete solutions.
Abstract
The rapid growth of Artificial Intelligence (AI) models and applications has led to an increasingly complex security landscape. Developers of AI projects must contend not only with traditional software supply chain issues but also with novel, AI-specific security threats. However, little is known about what security issues are commonly encountered and how they are resolved in practice. This gap hinders the development of effective security measures for each component of the AI supply chain. We bridge this gap by conducting an empirical investigation of developer-reported issues and solutions, based on discussions from Hugging Face and GitHub. To identify security-related discussions, we develop a pipeline that combines keyword matching with an optimal fine-tuned distilBERT classifier, which achieved the best performance in our extensive comparison of various deep learning and large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation and Cyber Security · Advanced Malware Detection Techniques · Software Engineering Research
