Leveraging Flawed Tutorials for Seeding Large-Scale Web Vulnerability Discovery
Tommi Unruh, Bhargava Shastry, Malte Skoruppa, Federico Maggi, Konrad, Rieck, Jean-Pierre Seifert, Fabian Yamaguchi

TL;DR
This paper demonstrates that vulnerable code snippets in popular tutorials can be leveraged to identify similar security flaws in thousands of open-source web applications, revealing widespread security risks.
Contribution
It introduces a semi-automated method to use flawed tutorials as seeds for large-scale vulnerability discovery in web applications.
Findings
Found 117 vulnerabilities similar to tutorial snippets in 64,415 PHP projects
Showed tutorials often contain security flaws that are reused in real-world code
Validated the feasibility of large-scale vulnerability detection from poor-quality tutorials
Abstract
The Web is replete with tutorial-style content on how to accomplish programming tasks. Unfortunately, even top-ranked tutorials suffer from severe security vulnerabilities, such as cross-site scripting (XSS), and SQL injection (SQLi). Assuming that these tutorials influence real-world software development, we hypothesize that code snippets from popular tutorials can be used to bootstrap vulnerability discovery at scale. To validate our hypothesis, we propose a semi-automated approach to find recurring vulnerabilities starting from a handful of top-ranked tutorials that contain vulnerable code snippets. We evaluate our approach by performing an analysis of tens of thousands of open-source web applications to check if vulnerabilities originating in the selected tutorials recur. Our analysis framework has been running on a standard PC, analyzed 64,415 PHP codebases hosted on GitHub thus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Application Security Vulnerabilities · Software Engineering Research · Software Testing and Debugging Techniques
