Assessing the Effectiveness of LLMs in Android Application Vulnerability Analysis
Vasileios Kouliaridis, Georgios Karopoulos, Georgios Kambourakis

TL;DR
This study evaluates nine state-of-the-art large language models' effectiveness in detecting Android application vulnerabilities, comparing their performance on a dataset of vulnerable code samples and exploring retrieval-augmented techniques.
Contribution
It provides a comprehensive comparison of LLMs in Android vulnerability detection and introduces insights into context augmentation methods like RAG for improved security analysis.
Findings
Significant performance differences among LLMs in vulnerability detection
Retrieval-augmented generation enhances detection capabilities
Performance varies with obfuscated code samples
Abstract
The increasing frequency of attacks on Android applications coupled with the recent popularity of large language models (LLMs) necessitates a comprehensive understanding of the capabilities of the latter in identifying potential vulnerabilities, which is key to mitigate the overall risk. To this end, the work at hand compares the ability of nine state-of-the-art LLMs to detect Android code vulnerabilities listed in the latest Open Worldwide Application Security Project (OWASP) Mobile Top 10. Each LLM was evaluated against an open dataset of over 100 vulnerable code samples, including obfuscated ones, assessing each model's ability to identify key vulnerabilities. Our analysis reveals the strengths and weaknesses of each LLM, identifying important factors that contribute to their performance. Additionally, we offer insights into context augmentation with retrieval-augmented generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Web Application Security Vulnerabilities · Mobile and Web Applications
