Enhancing Password Security Through a High-Accuracy Scoring Framework Using Random Forests
Muhammed El Mustaqeem Mazelan, Noor Hazlina Abdul, Nouar AlDahoul

TL;DR
This paper presents a high-accuracy password strength scoring system using Random Forests with novel feature engineering, significantly improving security assessment and user feedback over traditional methods.
Contribution
It introduces a hybrid feature engineering approach and demonstrates that Random Forests outperform other models in password strength prediction, enhancing practical cybersecurity tools.
Findings
RF model achieved 99.12% accuracy on test data
Feature importance analysis provides actionable security insights
Hybrid features effectively capture password vulnerabilities
Abstract
Password security plays a crucial role in cybersecurity, yet traditional password strength meters, which rely on static rules like character-type requirements, often fail. Such methods are easily bypassed by common password patterns (e.g., 'P@ssw0rd1!'), giving users a false sense of security. To address this, we implement and evaluate a password strength scoring system by comparing four machine learning models: Random Forest (RF), Support Vector Machine (SVM), a Convolutional Neural Network (CNN), and Logistic Regression with a dataset of over 660,000 real-world passwords. Our primary contribution is a novel hybrid feature engineering approach that captures nuanced vulnerabilities missed by standard metrics. We introduce features like leetspeak-normalized Shannon entropy to assess true randomness, pattern detection for keyboard walks and sequences, and character-level TF-IDF n-grams to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUser Authentication and Security Systems · Advanced Malware Detection Techniques · Biometric Identification and Security
