EnStack: An Ensemble Stacking Framework of Large Language Models for Enhanced Vulnerability Detection in Source Code
Shahriyar Zaman Ridoy, Md. Shazzad Hossain Shaon, Alfredo Cuzzocrea,, Mst Shapna Akter

TL;DR
EnStack is an ensemble framework combining multiple large language models and meta-classifiers to improve automated vulnerability detection in source code, outperforming existing methods in accuracy and robustness.
Contribution
This paper introduces EnStack, a novel ensemble stacking framework that leverages multiple pre-trained LLMs and meta-classifiers for enhanced vulnerability detection in source code.
Findings
EnStack outperforms existing vulnerability detection methods.
The ensemble achieves higher accuracy, precision, recall, and F1-score.
Meta-classifiers effectively combine diverse LLM outputs for better detection.
Abstract
Automated detection of software vulnerabilities is critical for enhancing security, yet existing methods often struggle with the complexity and diversity of modern codebases. In this paper, we introduce EnStack, a novel ensemble stacking framework that enhances vulnerability detection using natural language processing (NLP) techniques. Our approach synergizes multiple pre-trained large language models (LLMs) specialized in code understanding CodeBERT for semantic analysis, GraphCodeBERT for structural representation, and UniXcoder for cross-modal capabilities. By fine-tuning these models on the Draper VDISC dataset and integrating their outputs through meta-classifiers such as Logistic Regression, Support Vector Machines (SVM), Random Forest, and XGBoost, EnStack effectively captures intricate code patterns and vulnerabilities that individual models may overlook. The meta-classifiers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Web Application Security Vulnerabilities · Software Engineering Research
MethodsLogistic Regression · CodeBERT
