LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators
Mateusz Lango, Ond\v{r}ej Du\v{s}ek

TL;DR
This paper introduces a neurosymbolic framework where multiple LLM agents collaboratively generate rule-based RDF-to-text systems without supervised data, resulting in interpretable outputs with reduced hallucination and efficient generation.
Contribution
It presents a novel collaborative LLM-based approach for building interpretable RDF-to-text generators without supervised training data.
Findings
Reduces hallucination in generated text
Achieves near-instantaneous generation on CPU
Maintains fluency comparable to finetuned models
Abstract
We present a novel neurosymbolic framework for RDF-to-text generation, in which the model is "trained" through collaborative interactions among multiple LLM agents rather than traditional backpropagation. The LLM agents produce rule-based Python code for a generator for the given domain, based on RDF triples only, with no in-domain human reference texts. The resulting system is fully interpretable, requires no supervised training data, and generates text nearly instantaneously using only a single CPU. Our experiments on the WebNLG and OpenDialKG data show that outputs produced by our approach reduce hallucination, with only slight fluency penalties compared to finetuned or prompted language models
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
