LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators

Mateusz Lango; Ond\v{r}ej Du\v{s}ek

arXiv:2512.18360·cs.CL·December 23, 2025

LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators

Mateusz Lango, Ond\v{r}ej Du\v{s}ek

PDF

Open Access 1 Video

TL;DR

This paper introduces a neurosymbolic framework where multiple LLM agents collaboratively generate rule-based RDF-to-text systems without supervised data, resulting in interpretable outputs with reduced hallucination and efficient generation.

Contribution

It presents a novel collaborative LLM-based approach for building interpretable RDF-to-text generators without supervised training data.

Findings

01

Reduces hallucination in generated text

02

Achieves near-instantaneous generation on CPU

03

Maintains fluency comparable to finetuned models

Abstract

We present a novel neurosymbolic framework for RDF-to-text generation, in which the model is "trained" through collaborative interactions among multiple LLM agents rather than traditional backpropagation. The LLM agents produce rule-based Python code for a generator for the given domain, based on RDF triples only, with no in-domain human reference texts. The resulting system is fully interpretable, requires no supervised training data, and generates text nearly instantaneously using only a single CPU. Our experiments on the WebNLG and OpenDialKG data show that outputs produced by our approach reduce hallucination, with only slight fluency penalties compared to finetuned or prompted language models

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications