CHASE: LLM Agents for Dissecting Malicious PyPI Packages

Takaaki Toda; Tatsuya Mori

arXiv:2601.06838·cs.CR·January 13, 2026

CHASE: LLM Agents for Dissecting Malicious PyPI Packages

Takaaki Toda, Tatsuya Mori

PDF

Open Access

TL;DR

CHASE introduces a multi-agent system utilizing LLMs and deterministic tools to reliably detect malicious PyPI packages, achieving high accuracy and efficiency suitable for real-world deployment.

Contribution

The paper presents a novel multi-agent architecture that overcomes LLM limitations for malware detection in software packages, combining semantic analysis with deterministic security tools.

Findings

01

Achieves 98.4% recall with 0.08% false positives

02

Median analysis time of 4.5 minutes per package

03

Effective in operational package screening environments

Abstract

Modern software package registries like PyPI have become critical infrastructure for software development, but are increasingly exploited by threat actors distributing malicious packages with sophisticated multi-stage attack chains. While Large Language Models (LLMs) offer promising capabilities for automated code analysis, their application to security-critical malware detection faces fundamental challenges, including hallucination and context confusion, which can lead to missed detections or false alarms. We present CHASE (Collaborative Hierarchical Agents for Security Exploration), a high-reliability multi-agent architecture that addresses these limitations through a Plan-and-Execute coordination model, specialized Worker Agents focused on specific analysis aspects, and integration with deterministic security tools for critical operations. Our key insight is that reliability in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Adversarial Robustness in Machine Learning