Cross-project Classification of Security-related Requirements
Mazen Mohamad, Jan-Philipp Stegh\"ofer, Riccardo Scandariato

TL;DR
This paper explores the potential of using machine learning classifiers trained on online requirement specifications to identify security-related requirements across different domains and styles, aiding compliance and security assurance processes.
Contribution
It demonstrates the feasibility of cross-project classification of security requirements using heterogeneous data and highlights the importance of data consistency for improved accuracy.
Findings
Feasibility of training classifiers on diverse requirement data
Performance improves with data consistency revisions
Type-specific training yields better results
Abstract
We investigate the feasibility of using a classifier for security-related requirements trained on requirement specifications available online. This is helpful in case different requirement types are not differentiated in a large existing requirement specification. Our work is motivated by the need to identify security requirements for the creation of security assurance cases that become a necessity for many organizations with new and upcoming standards like GDPR and HiPAA. We base our investigation on ten requirement specifications, randomly selected from a Google Search and partially pre-labeled. To validate the model, we run 10-fold cross-validation on the data where each specification constitutes a group. Our results indicate the feasibility of training a model from a heterogeneous data set including specifications from multiple domains and in different styles. However, performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation and Cyber Security · Safety Systems Engineering in Autonomy · Software Reliability and Analysis Research
