Improving Requirements Completeness: Automated Assistance through Large Language Models
Dipeeka Luitel, Shabnam Hassani, Mehrdad Sabetzadeh

TL;DR
This paper investigates using BERT, a large language model, to detect incompleteness in natural language requirements by predicting missing content, demonstrating its effectiveness and proposing methods to optimize and filter predictions.
Contribution
It introduces a novel approach utilizing BERT's MLM to identify missing information in requirements and develops techniques to optimize prediction accuracy and reduce noise.
Findings
BERT effectively highlights missing terminology in requirements.
BERT outperforms simpler baseline models in detecting omissions.
A machine learning filter reduces noise and improves prediction quality.
Abstract
Natural language (NL) is arguably the most prevalent medium for expressing systems and software requirements. Detecting incompleteness in NL requirements is a major challenge. One approach to identify incompleteness is to compare requirements with external sources. Given the rise of large language models (LLMs), an interesting question arises: Are LLMs useful external sources of knowledge for detecting potential incompleteness in NL requirements? This article explores this question by utilizing BERT. Specifically, we employ BERT's masked language model (MLM) to generate contextualized predictions for filling masked slots in requirements. To simulate incompleteness, we withhold content from the requirements and assess BERT's ability to predict terminology that is present in the withheld content but absent in the disclosed content. BERT can produce multiple predictions per mask. Our first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Software Reliability and Analysis Research
