A Novel Approach to Identify Security Controls in Source Code
Ahmet Okutan, Ali Shokri, Viktoria Koscinski, Mohamad Fazelinia, Mehdi, Mirakhorli

TL;DR
This paper presents a new method using NLP and machine learning to identify security control code snippets in software, aiding secure development and architecture review.
Contribution
It introduces a dataset of security control code snippets from StackOverflow and applies BERT and a tactic detector to accurately identify tactic-related code.
Findings
Achieved F-Measure above 0.9 in identifying security control code snippets
Demonstrated effectiveness of NLP techniques in security code analysis
Provided a comprehensive list of security controls and related code datasets
Abstract
Secure by Design has become the mainstream development approach ensuring that software systems are not vulnerable to cyberattacks. Architectural security controls need to be carefully monitored over the software development life cycle to avoid critical design flaws. Unfortunately, functional requirements usually get in the way of the security features, and the development team may not correctly address critical security requirements. Identifying tactic-related code pieces in a software project enables an efficient review of the security controls' implementation as well as a resilient software architecture. This paper enumerates a comprehensive list of commonly used security controls and creates a dataset for each one of them by pulling related and unrelated code snippets from the open API of the StackOverflow question and answer platform. It uses the state-of-the-art NLP technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Information and Cyber Security · Software Reliability and Analysis Research
