Code Vulnerability Detection Across Different Programming Languages with AI Models
Hael Abdulhakim Ali Humran, Ferdi Sonmez

TL;DR
This paper explores the use of transformer-based AI models like CodeBERT and CodeLlama for detecting code vulnerabilities across multiple programming languages, demonstrating high accuracy and potential for improved security analysis.
Contribution
It introduces a methodology for fine-tuning AI models on diverse vulnerability datasets and incorporates ensemble learning and explainability to enhance detection performance.
Findings
CodeBERT achieves over 97% accuracy in vulnerability detection.
AI models generalize well across different programming languages.
Hybrid approaches reduce false positives and improve reliability.
Abstract
Security vulnerabilities present in a code that has been written in diverse programming languages are among the most critical yet complicated aspects of source code to detect. Static analysis tools based on rule-based patterns usually do not work well at detecting the context-dependent bugs and lead to high false positive rates. Recent developments in artificial intelligence, specifically the use of transformer-based models like CodeBERT and CodeLlama, provide light to this problem, as they show potential in finding such flaws better. This paper presents the implementations of these models on various datasets of code vulnerability, showing how off-the-shelf models can successfully produce predictive capacity in models through dynamic fine-tuning of the models on vulnerable and safe code fragments. The methodology comprises the gathering of the dataset, normalization of the language,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Software Engineering Research · Advanced Malware Detection Techniques
