Machine Learning Models Have a Supply Chain Problem

Sarah Meiklejohn; Hayden Blauzvern; Mihai Maruseac; Spencer Schrock; Laurent Simon; Ilia Shumailov

arXiv:2505.22778·cs.LG·May 30, 2025

Machine Learning Models Have a Supply Chain Problem

Sarah Meiklejohn, Hayden Blauzvern, Mihai Maruseac, Spencer Schrock, Laurent Simon, Ilia Shumailov

PDF

Open Access

TL;DR

This paper highlights the supply chain risks associated with open machine learning models, such as malicious replacements and data poisoning, and proposes using Sigstore to enhance transparency and security in model distribution.

Contribution

It identifies key supply chain vulnerabilities in open ML ecosystems and explores applying Sigstore for model signing and dataset verification to mitigate these risks.

Findings

01

Open ML models face significant supply chain threats.

02

Sigstore can be adapted to improve transparency in ML model sharing.

03

Model signing and dataset verification can reduce malicious attacks.

Abstract

Powerful machine learning (ML) models are now readily available online, which creates exciting possibilities for users who lack the deep technical expertise or substantial computing resources needed to develop them. On the other hand, this type of open ecosystem comes with many risks. In this paper, we argue that the current ecosystem for open ML models contains significant supply-chain risks, some of which have been exploited already in real attacks. These include an attacker replacing a model with something malicious (e.g., malware), or a model being trained using a vulnerable version of a framework or on restricted or poisoned data. We then explore how Sigstore, a solution designed to bring transparency to open-source software supply chains, can be used to bring transparency to open ML models, in terms of enabling model publishers to sign their models and prove properties about the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Ethics and Social Impacts of AI