Towards Scalable and Interpretable Mobile App Risk Analysis via Large Language Models
Yu Yang, Zhenyuan Li, Xiandong Ran, Jiahao Liu, Jiahui Wang, Bo Yu, Shouling Ji

TL;DR
This paper introduces Mars, a system utilizing Large Language Models to automate mobile app risk analysis, significantly improving efficiency and accuracy over manual vetting processes.
Contribution
Mars is the first system to combine LLMs with a risk identification tree for scalable, interpretable, and automated mobile app risk assessment.
Findings
Achieved an F1-score of 0.838 in risk identification.
Attained an F1-score of 0.934 in evidence retrieval.
User study showed 60-90% efficiency gains.
Abstract
Mobile application marketplaces are responsible for vetting apps to identify and mitigate security risks. Current vetting processes are labor-intensive, relying on manual analysis by security professionals aided by semi-automated tools. To address this inefficiency, we propose Mars, a system that leverages Large Language Models (LLMs) for automated risk identification and profiling. Mars is designed to concurrently analyze multiple applications across diverse risk categories with minimal human intervention. To enhance analytical precision and operational efficiency, Mars leverages a pre-constructed risk identification tree to extract relevant indicators from high-dimensional application features. This initial step filters the data, reducing the input volume for the LLM and mitigating the potential for model hallucination induced by irrelevant features. The extracted indicators are then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human-Automation Interaction and Safety · Ethics and Social Impacts of AI
