EnStack: An Ensemble Stacking Framework of Large Language Models for   Enhanced Vulnerability Detection in Source Code

Shahriyar Zaman Ridoy; Md. Shazzad Hossain Shaon; Alfredo Cuzzocrea,; Mst Shapna Akter

arXiv:2411.16561·cs.SE·November 26, 2024

EnStack: An Ensemble Stacking Framework of Large Language Models for Enhanced Vulnerability Detection in Source Code

Shahriyar Zaman Ridoy, Md. Shazzad Hossain Shaon, Alfredo Cuzzocrea,, Mst Shapna Akter

PDF

Open Access

TL;DR

EnStack is an ensemble framework combining multiple large language models and meta-classifiers to improve automated vulnerability detection in source code, outperforming existing methods in accuracy and robustness.

Contribution

This paper introduces EnStack, a novel ensemble stacking framework that leverages multiple pre-trained LLMs and meta-classifiers for enhanced vulnerability detection in source code.

Findings

01

EnStack outperforms existing vulnerability detection methods.

02

The ensemble achieves higher accuracy, precision, recall, and F1-score.

03

Meta-classifiers effectively combine diverse LLM outputs for better detection.

Abstract

Automated detection of software vulnerabilities is critical for enhancing security, yet existing methods often struggle with the complexity and diversity of modern codebases. In this paper, we introduce EnStack, a novel ensemble stacking framework that enhances vulnerability detection using natural language processing (NLP) techniques. Our approach synergizes multiple pre-trained large language models (LLMs) specialized in code understanding CodeBERT for semantic analysis, GraphCodeBERT for structural representation, and UniXcoder for cross-modal capabilities. By fine-tuning these models on the Draper VDISC dataset and integrating their outputs through meta-classifiers such as Logistic Regression, Support Vector Machines (SVM), Random Forest, and XGBoost, EnStack effectively captures intricate code patterns and vulnerabilities that individual models may overlook. The meta-classifiers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Web Application Security Vulnerabilities · Software Engineering Research

MethodsLogistic Regression · CodeBERT