An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations
Shabnam Hassani, Mehrdad Sabetzadeh, Daniel Amyot

TL;DR
This study evaluates the effectiveness of large language models like GPT-4o and BERT in classifying requirements-related provisions in food-safety regulations, demonstrating high accuracy and generalizability across jurisdictions.
Contribution
It introduces a grounded theory-based conceptual framework for food-safety regulations and empirically compares LLMs' performance in legal classification tasks.
Findings
GPT-4o achieves 89% precision and 87% recall in classification.
Few-shot learning with GPT-4o boosts recall to 97%, but reduces precision to 65%.
LLMs outperform LSTM and keyword-based baselines in accuracy.
Abstract
As Industry 4.0 transforms the food industry, the role of software in achieving compliance with food-safety regulations is becoming increasingly critical. Food-safety regulations, like those in many legal domains, have largely been articulated in a technology-independent manner to ensure their longevity and broad applicability. However, this approach leaves a gap between the regulations and the modern systems and software increasingly used to implement them. In this article, we pursue two main goals. First, we conduct a Grounded Theory study of food-safety regulations and develop a conceptual characterization of food-safety concepts that closely relate to systems and software requirements. Second, we examine the effectiveness of two families of large language models (LLMs) -- BERT and GPT -- in automatically classifying legal provisions based on requirements-related food-safety…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFood Supply Chain Traceability · Food Safety and Hygiene · Topic Modeling
