Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text   Generation: A State-of-the-Art Investigation

Joy Mahapatra; Utpal Garain

arXiv:2407.14088·cs.CL·July 22, 2024

Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation

Joy Mahapatra, Utpal Garain

PDF

Open Access

TL;DR

This study investigates how the size of fine-tuned large language models affects their performance in data-to-text generation, revealing that larger models improve readability and informativeness but may reduce faithfulness, especially under source-reference divergence.

Contribution

It provides a comprehensive analysis of the impact of model size on D2T performance across multiple datasets, metrics, and LLM families, highlighting benefits and limitations of scaling.

Findings

01

Larger LLMs improve readability and informativeness.

02

Increased size may reduce faithfulness in generated text.

03

Small LLMs are more resilient to source-reference divergence.

Abstract

Data-to-text (D2T) generation aims to generate human-readable text from semi-structured data, such as tables and graphs. The recent success of D2T is largely attributed to advancements in LLMs. Despite the success of LLMs, no research has been conducted to illustrate the impact of model size on the performance of fine-tuned LLMs for D2T tasks. D2T model performance is typically assessed based on three key qualities: \textit{readability} (indicates fluency and coherence), \textit{informativeness} (measures content similarity), and \textit{faithfulness} (assesses consistency of factual information). It is currently uncertain whether increasing the size of LLMs effectively improves performance in D2T tasks across these three qualities. The objective of this study is to investigate the performance of fine-tuned LLMs in D2T tasks in terms of model size. Through extensive comparative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · Dense Connections · Residual Connection · Adam · Dropout · Byte Pair Encoding