Learning-Based Relaxation of Completeness Requirements for Data Entry Forms
Hichem Belgacem, Xiaochen Li, Domenico Bianculli, Lionel C. Briand

TL;DR
LACQUER is a learning-based system that automatically relaxes obsolete completeness requirements in data entry forms, reducing meaningless entries and improving data quality through Bayesian Networks and oversampling techniques.
Contribution
It introduces a novel automated approach using Bayesian Networks and SMOTE to identify and relax obsolete required fields in data entry forms, outperforming manual rule-based methods.
Findings
Achieves precision between 0.76 and 0.90 in relaxing requirements.
Prevents 20% to 64% of meaningless user inputs.
Predicts completeness requirements within 839 ms.
Abstract
Data entry forms use completeness requirements to specify the fields that are required or optional to fill for collecting necessary information from different types of users. However, some required fields may not be applicable for certain types of users anymore. Nevertheless, they may still be incorrectly marked as required in the form; we call such fields obsolete required fields. Since obsolete required fields usually have not-null validation checks before submitting the form, users have to enter meaningless values in such fields in order to complete the form submission. These meaningless values threaten the quality of the filled data. To avoid users filling meaningless values, existing techniques usually rely on manually written rules to identify the obsolete required fields and relax their completeness requirements. However, these techniques are ineffective and costly. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Mining Algorithms and Applications · Traffic Prediction and Management Techniques
