Deep Dive into the Abuse of DL APIs To Create Malicious AI Models and How to Detect Them

Mohamed Nabeel; Oleksii Starov

arXiv:2601.04553·cs.CR·January 9, 2026

Deep Dive into the Abuse of DL APIs To Create Malicious AI Models and How to Detect Them

Mohamed Nabeel, Oleksii Starov

PDF

Open Access

TL;DR

This paper explores how malicious actors can abuse deep learning APIs like TensorFlow to inject hidden functionalities for attacks, highlighting detection challenges and proposing methods using LLMs for improved security.

Contribution

It uncovers the potential for abusing TensorFlow APIs for malicious purposes and develops detection techniques leveraging large language models to identify such abuses.

Findings

01

Existing scanners fail to detect API abuse due to lack of semantic analysis

02

Demonstrated attacks exploiting hidden API functionalities in TensorFlow

03

Proposed LLM-based scanners improve detection of malicious API usage

Abstract

According to Gartner, more than 70% of organizations will have integrated AI models into their workflows by the end of 2025. In order to reduce cost and foster innovation, it is often the case that pre-trained models are fetched from model hubs like Hugging Face or TensorFlow Hub. However, this introduces a security risk where attackers can inject malicious code into the models they upload to these hubs, leading to various kinds of attacks including remote code execution (RCE), sensitive data exfiltration, and system file modification when these models are loaded or executed (predict function). Since AI models play a critical role in digital transformation, this would drastically increase the number of software supply chain attacks. While there are several efforts at detecting malware when deserializing pickle based saved models (hiding malware in model parameters), the risk of abusing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Security and Verification in Computing