Challenging Machine Learning Algorithms in Predicting Vulnerable JavaScript Functions
Rudolf Ferenc, P\'eter Heged\H{u}s, P\'eter Gyimesi, G\'abor Antal,, D\'enes B\'an, Tibor Gyim\'othy

TL;DR
This study evaluates various machine learning algorithms, including deep learning, for predicting security vulnerabilities in JavaScript functions, using a new dataset and static code metrics, with KNN showing the best performance.
Contribution
It introduces a new dataset for vulnerability prediction in JavaScript and compares multiple ML algorithms, highlighting the effectiveness of KNN and the impact of re-sampling strategies.
Findings
KNN achieved an F-measure of 0.76 in vulnerability prediction.
Deep learning and SVM classifiers performed competitively with F-measures over 0.70.
Re-sampling strategies affected the balance between precision and recall.
Abstract
The rapid rise of cyber-crime activities and the growing number of devices threatened by them place software security issues in the spotlight. As around 90% of all attacks exploit known types of security issues, finding vulnerable components and applying existing mitigation techniques is a viable practical approach for fighting against cyber-crime. In this paper, we investigate how the state-of-the-art machine learning techniques, including a popular deep learning algorithm, perform in predicting functions with possible security vulnerabilities in JavaScript programs. We applied 8 machine learning algorithms to build prediction models using a new dataset constructed for this research from the vulnerability information in public databases of the Node Security Project and the Snyk platform, and code fixing patches from GitHub. We used static source code metrics as predictors and an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques
