P-TA: Using Proximal Policy Optimization to Enhance Tabular Data   Augmentation via Large Language Models

Shuo Yang; Chenchen Yuan; Yao Rong; Felix Steinbauer; Gjergji; Kasneci

arXiv:2406.11391·cs.LG·February 25, 2025

P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models

Shuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji, Kasneci

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel method combining Proximal Policy Optimization with Large Language Models to improve tabular data augmentation, resulting in more accurate synthetic data generation for better model training.

Contribution

The paper presents a new approach that integrates PPO with LLMs to enhance tabular data synthesis, addressing limitations of existing GAN and LLM methods.

Findings

01

PPO-guided LLMs improve data quality.

02

Achieved 4% accuracy increase on real-world datasets.

03

Outperforms state-of-the-art data augmentation methods.

Abstract

A multitude of industries depend on accurate and reasonable tabular data augmentation for their business processes. Contemporary methodologies in generating tabular data revolve around utilizing Generative Adversarial Networks (GAN) or fine-tuning Large Language Models (LLM). However, GAN-based approaches are documented to produce samples with common-sense errors attributed to the absence of external knowledge. On the other hand, LLM-based methods exhibit a limited capacity to capture the disparities between synthesized and actual data distribution due to the absence of feedback from a discriminator during training. Furthermore, the decoding of LLM-based generation introduces gradient breakpoints, impeding the backpropagation of loss from a discriminator, thereby complicating the integration of these two approaches. To solve this challenge, we propose using proximal policy optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models· underline

Taxonomy

TopicsData Quality and Management · Topic Modeling · Text Readability and Simplification

MethodsEntropy Regularization · Proximal Policy Optimization