InspectJS: Leveraging Code Similarity and User-Feedback for Effective Taint Specification Inference for JavaScript
Saikat Dutta, Diego Garbervetsky, Shuvendu Lahiri, Max Sch\"afer

TL;DR
This paper combines machine learning and manual techniques within GitHub's CodeQL framework to improve the inference of taint specifications for JavaScript, enhancing vulnerability detection accuracy.
Contribution
It introduces a hybrid approach that leverages code similarity and user feedback to automate and refine taint sink inference for JavaScript libraries.
Findings
Machine learning infers new taint sinks not captured manually.
Code similarity metrics help organize and prioritize sink predictions.
Hybrid approach improves taint specification accuracy.
Abstract
Static analysis has established itself as a weapon of choice for detecting security vulnerabilities. Taint analysis in particular is a very general and powerful technique, where security policies are expressed in terms of forbidden flows, either from untrusted input sources to sensitive sinks (in integrity policies) or from sensitive sources to untrusted sinks (in confidentiality policies). The appeal of this approach is that the taint-tracking mechanism has to be implemented only once, and can then be parameterized with different taint specifications (that is, sets of sources and sinks, as well as any sanitizers that render otherwise problematic flows innocuous) to detect many different kinds of vulnerabilities. But while techniques for implementing scalable inter-procedural static taint tracking are fairly well established, crafting taint specifications is still more of an art than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Web Application Security Vulnerabilities · Advanced Malware Detection Techniques
