eDySec: A Deep Learning-based Explainable Dynamic Analysis Framework for Detecting Malicious Packages in PyPI Ecosystem
Sk Tanzir Mehedi, Raja Jurdak, Chadni Islam, Abu Bakar Siddique Mahi, and Gowri Ramachandran

TL;DR
eDySec is a deep learning framework that enhances detection, explainability, and stability in identifying malicious packages in the PyPI ecosystem, significantly reducing false positives and negatives.
Contribution
The paper introduces eDySec, a novel DL-based dynamic behavioral analysis framework that improves detection accuracy, stability, and explainability for malicious package identification.
Findings
eDySec halves feature dimensionality and reduces false positives by 82%.
It improves detection accuracy by 3% and achieves near-perfect stability.
Inference latency per package is maintained at 170ms.
Abstract
The security of open-source software repositories is increasingly threatened by next-gen software supply chain attacks. These attacks include multiphase malware execution, remote access activation, and dynamic payload generation. Traditional Machine Learning (ML) detectors struggle to detect these attacks due to the high-dimensional and sparse nature of dynamic behavioral data, including system calls, network traffic, directory access patterns, and dependency logs. As a result, these data characteristics degrade the performance, stability, and explainability of ML models. These challenges have made Deep Learning (DL) a promising alternative, given its success across various domains and its potential for modeling complex patterns. This paper presents eDySec, a DL-based efficient, stable, and explainable framework for dynamic behavioral analysis to detect malicious packages. Using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
