Data2Concept2Text: An Explainable Multilingual Framework for Data Analysis Narration
Flavio Bertini (UNIPR), Alessandro Dal Pal\`u (UNIPR), Federica Zaglio, (UNIPR), Francesco Fabiano (NMSU), Andrea Formisano (UNIUD)

TL;DR
This paper introduces an explainable, multilingual system that interprets data by extracting concepts and translating them into natural language using logic-based rules, enhancing trustworthiness and accessibility in critical applications.
Contribution
It develops a transparent, rule-based framework for data-to-text generation that leverages logic programming and ontology reasoning, advancing explainability in natural language generation.
Findings
Demonstrates flexible multilingual natural language generation.
Shows the system can produce diverse, equivalent rewritings.
Highlights the importance of explainability in data interpretation.
Abstract
This paper presents a complete explainable system that interprets a set of data, abstracts the underlying features and describes them in a natural language of choice. The system relies on two crucial stages: (i) identifying emerging properties from data and transforming them into abstract concepts, and (ii) converting these concepts into natural language. Despite the impressive natural language generation capabilities demonstrated by Large Language Models, their statistical nature and the intricacy of their internal mechanism still force us to employ these techniques as black boxes, forgoing trustworthiness. Developing an explainable pipeline for data interpretation would allow facilitating its use in safety-critical environments like processing medical information and allowing non-experts and visually impaired people to access narrated information. To this end, we believe that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Focus
