Towards a Rigorous Statistical Analysis of Empirical Password Datasets
Jeremiah Blocki, Peiyuan Liu

TL;DR
This paper develops statistically rigorous methods to analyze the success probability of an optimal attacker in cracking passwords within a given number of guesses, based on password datasets and assumptions about attacker knowledge.
Contribution
It introduces new techniques to bound attacker success probabilities using limited data, and applies these to evaluate password distributions, throttling, and policies.
Findings
Empirical bounds closely match observed success rates for small guess counts.
Zipf's Law overestimates attacker success for large guess numbers.
Password policies can be quantitatively evaluated using the proposed bounds.
Abstract
A central challenge in password security is to characterize the attacker's guessing curve i.e., what is the probability that the attacker will crack a random user's password within the first guesses. A key challenge is that the guessing curve depends on the attacker's guessing strategy and the distribution of user passwords both of which are unknown to us. In this work we aim to follow Kerckhoffs' principle and analyze the performance of an optimal attacker who knows the password distribution. Let denote the probability that such an attacker can crack a random user's password within guesses. We develop several statistically rigorous techniques to upper and lower bound given independent samples from the unknown distribution. We show that our bounds hold with high confidence and apply our techniques to analyze eight password datasets. Our empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUser Authentication and Security Systems · Advanced Malware Detection Techniques · Biometric Identification and Security
