A Multilingual Human Annotated Corpus of Original and Easy-to-Read Texts to Support Access to Democratic Participatory Processes
Stefan Bott, Verena Riegler, Horacio Saggion, Almudena Rasc\'on Alcaina, Nouran Khallaf

TL;DR
This paper introduces a multilingual, human-annotated corpus of original and Easy-to-Read texts in English, Spanish, Catalan, and Italian, aimed at improving access to democratic participation for diverse language speakers.
Contribution
It provides the first annotated Easy-to-Read corpus for Catalan and high-quality resources for Spanish and Italian, supporting research in text simplification for democratic access.
Findings
First annotated E2R corpus for Catalan.
High-quality human simplifications for Spanish and Italian.
Resources will be publicly accessible.
Abstract
Being able to understand information is a key factor for a self-determined life and society. It is also very important for participating in democratic processes. The study of automatic text simplification is often limited by the availability of high quality material for the training and evaluation on automatic simplifiers. This is true for English, but more so for less resourced languages like Spanish, Catalan and Italian. In order to fill this gap, we present a corpus of original texts for these 3 languages, with high quality simplification produced by human experts in text simplification. It was developed within the iDEM project to assess the impact of Easy-to-Read (E2R) language for democratic participation. The original texts were compiled from domains related to this topic. The corpus includes different text types, selected based on relevance, copyright availability, and ethical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Wikis in Education and Collaboration · Authorship Attribution and Profiling
