Deep Dive into the Abuse of DL APIs To Create Malicious AI Models and How to Detect Them
Mohamed Nabeel, Oleksii Starov

TL;DR
This paper explores how malicious actors can abuse deep learning APIs like TensorFlow to inject hidden functionalities for attacks, highlighting detection challenges and proposing methods using LLMs for improved security.
Contribution
It uncovers the potential for abusing TensorFlow APIs for malicious purposes and develops detection techniques leveraging large language models to identify such abuses.
Findings
Existing scanners fail to detect API abuse due to lack of semantic analysis
Demonstrated attacks exploiting hidden API functionalities in TensorFlow
Proposed LLM-based scanners improve detection of malicious API usage
Abstract
According to Gartner, more than 70% of organizations will have integrated AI models into their workflows by the end of 2025. In order to reduce cost and foster innovation, it is often the case that pre-trained models are fetched from model hubs like Hugging Face or TensorFlow Hub. However, this introduces a security risk where attackers can inject malicious code into the models they upload to these hubs, leading to various kinds of attacks including remote code execution (RCE), sensitive data exfiltration, and system file modification when these models are loaded or executed (predict function). Since AI models play a critical role in digital transformation, this would drastically increase the number of software supply chain attacks. While there are several efforts at detecting malware when deserializing pickle based saved models (hiding malware in model parameters), the risk of abusing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Security and Verification in Computing
