Evaluating Large Language Models in detecting Secrets in Android Apps
Marco Alecci, Jordan Samhi, Tegawend\'e F. Bissyand\'e, and Jacques Klein

TL;DR
This paper introduces SecretLoc, an LLM-based method for detecting hardcoded secrets in Android apps, surpassing traditional pattern-based approaches and revealing numerous previously undetected secrets, highlighting security risks.
Contribution
Proposes SecretLoc, an LLM-driven approach that detects hardcoded secrets in Android apps without relying on prior patterns or training data, improving over existing methods.
Findings
Detected 4828 previously missed secrets
Discovered over 10 new secret types including API keys and private keys
Found secrets in 42.5% of analyzed apps
Abstract
Mobile apps often embed authentication secrets, such as API keys, tokens, and client IDs, to integrate with cloud services. However, developers often hardcode these credentials into Android apps, exposing them to extraction through reverse engineering. Once compromised, adversaries can exploit secrets to access sensitive data, manipulate resources, or abuse APIs, resulting in significant security and financial risks. Existing detection approaches, such as regex-based analysis, static analysis, and machine learning, are effective for identifying known patterns but are fundamentally limited: they require prior knowledge of credential structures, API signatures, or training data. In this paper, we propose SecretLoc, an LLM-based approach for detecting hardcoded secrets in Android apps. SecretLoc goes beyond pattern matching; it leverages contextual and structural cues to identify secrets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Digital and Cyber Forensics · Security and Verification in Computing
