Bridging Expert Reasoning and LLM Detection: A Knowledge-Driven Framework for Malicious Packages

Wenbo Guo; Shiwen Song; Jiaxun Guo; Zhengzi Xu; Chengwei Liu; Haoran Ou; Mengmeng Ge; Yang Liu

arXiv:2601.16458·cs.SE·January 26, 2026

Bridging Expert Reasoning and LLM Detection: A Knowledge-Driven Framework for Malicious Packages

Wenbo Guo, Shiwen Song, Jiaxun Guo, Zhengzi Xu, Chengwei Liu, Haoran Ou, Mengmeng Ge, Yang Liu

PDF

Open Access

TL;DR

IntelGuard is a knowledge-driven framework that enhances malicious package detection by integrating expert reasoning with large language models, achieving high accuracy and uncovering new threats in open-source ecosystems.

Contribution

It introduces a retrieval-augmented generation framework that combines expert threat intelligence with LLMs for more effective malicious package detection.

Findings

01

Achieves 99% detection accuracy on real-world packages

02

Discovered 54 previously unreported malicious packages on PyPI

03

Maintains high accuracy even on obfuscated code

Abstract

Open-source ecosystems such as NPM and PyPI are increasingly targeted by supply chain attacks, yet existing detection methods either depend on fragile handcrafted rules or data-driven features that fail to capture evolving attack semantics. We present IntelGuard, a retrieval-augmented generation (RAG) based framework that integrates expert analytical reasoning into automated malicious package detection. IntelGuard constructs a structured knowledge base from over 8,000 threat intelligence reports, linking malicious code snippets with behavioral descriptions and expert reasoning. When analyzing new packages, it retrieves semantically similar malicious examples and applies LLM-guided reasoning to assess whether code behaviors align with intended functionality. Experiments on 4,027 real-world packages show that IntelGuard achieves 99% accuracy and a 0.50% false positive rate, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Spam and Phishing Detection · Information and Cyber Security