Leveraging Large Language Models for Building Interpretable Rule-Based   Data-to-Text Systems

J\k{e}drzej Warczy\'nski; Mateusz Lango; Ondrej Dusek

arXiv:2502.20609·cs.CL·March 3, 2025

Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems

J\k{e}drzej Warczy\'nski, Mateusz Lango, Ondrej Dusek

PDF

TL;DR

This paper presents a method using large language models to automatically create interpretable rule-based data-to-text systems in Python, achieving better quality and efficiency than direct LLM prompts and fine-tuned models.

Contribution

The paper introduces a novel approach that leverages LLMs to automatically build fully interpretable rule-based systems, improving text quality and runtime efficiency.

Findings

01

Outperforms direct LLM prompts in text quality

02

Produces fewer hallucinations than fine-tuned BART

03

Operates faster with lower computational resources

Abstract

We introduce a simple approach that uses a large language model (LLM) to automatically implement a fully interpretable rule-based data-to-text system in pure Python. Experimental evaluation on the WebNLG dataset showed that such a constructed system produces text of better quality (according to the BLEU and BLEURT metrics) than the same LLM prompted to directly produce outputs, and produces fewer hallucinations than a BART language model fine-tuned on the same data. Furthermore, at runtime, the approach generates text in a fraction of the processing time required by neural approaches, using only a single CPU

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.