A Legal Framework for Natural Language Processing Model Training in Portugal
R\'uben Almeida, Evelin Amorim

TL;DR
This paper proposes a legal framework tailored for Portuguese NLP model training, aiming to bridge the gap between legal and technical communities to ensure compliance amidst rapid AI advancements.
Contribution
It introduces a multidisciplinary approach to align NLP development with Portuguese legislation, addressing legal concerns and promoting responsible AI research.
Findings
Identification of key legal issues in Portuguese NLP development
Guidelines for legal compliance in NLP model training
Enhanced understanding between legal and technical teams
Abstract
Recent advances in deep learning have promoted the advent of many computational systems capable of performing intelligent actions that, until then, were restricted to the human intellect. In the particular case of human languages, these advances allowed the introduction of applications like ChatGPT that are capable of generating coherent text without being explicitly programmed to do so. Instead, these models use large volumes of textual data to learn meaningful representations of human languages. Associated with these advances, concerns about copyright and data privacy infringements caused by these applications have emerged. Despite these concerns, the pace at which new natural language processing applications continued to be developed largely outperformed the introduction of new regulations. Today, communication barriers between legal experts and computer scientists motivate many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEuropean Criminal Justice and Data Protection
