RDF2PT: Generating Brazilian Portuguese Texts from RDF Data
Diego Moussallem, Thiago Castro Ferreira, Marcos Zampieri, Maria, Claudia Cavalcanti, Geraldo Xex\'eo, Mariana Neves, Axel-Cyrille Ngonga Ngomo

TL;DR
RDF2PT is a novel method for generating natural Brazilian Portuguese texts from RDF data, filling a gap in multilingual data verbalization and demonstrating human-like quality in generated texts.
Contribution
This work introduces RDF2PT, the first approach to verbalize RDF data into Brazilian Portuguese, expanding multilingual natural language generation capabilities.
Findings
Generated texts are comparable to human-produced language
Participants found the texts easily understandable
RDF2PT effectively verbalizes RDF data in Brazilian Portuguese
Abstract
The generation of natural language from Resource Description Framework (RDF) data has recently gained significant attention due to the continuous growth of Linked Data. A number of these approaches generate natural language in languages other than English, however, no work has been proposed to generate Brazilian Portuguese texts out of RDF. We address this research gap by presenting RDF2PT, an approach that verbalizes RDF data to Brazilian Portuguese language. We evaluated RDF2PT in an open questionnaire with 44 native speakers divided into experts and non-experts. Our results suggest that RDF2PT is able to generate text which is similar to that generated by humans and can hence be easily understood.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
